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BOUNDS FOR THE CHARACTERISTIC VALUES 
OF MATRIX FUNCTIONS 
By 8. N. AFRIAT (Ozford) 
[Received 13 January 1950] 

1. Introduction 

For any square complex matrix of finite or infinite order, a pair of non- 
negative real numbers, called the wpper and lower moduli of the matrix, 
is introduced. These are upper and lower bounds for the moduli of the 
characteristic values of the matrix. They generalize the concept of the 
modulus of a complex number, and satisfy certain inequalities for sums 
and products. 

Conditions for the convergence of infinite matrix sums and products 
are deduced, and it is shown that an upper bound for the moduli of the 
characteristic values of a matrix function, formed from sums and pro- 
ducts in a set of matrix arguments, is the corresponding scalar function 
of the upper moduli of the arguments. 

2. DEFINITION. The upper and lower moduli \a\*, |\a|, of a matrix a 
are the non-negative real numbers whose squares are the maximum and 
minimum characteristic values respectively of the non-negative definite 
Hermitian matrix A'a. 

3. THEOREM. [f « is a characteristic value of a, then 

laly < lal < [al*. 

Let x*, x, be unit latent vectors of 4’a corresponding to the charac- 
teristic values |a|**, |a|? respectively. Then the Hermitian form &’A’ax, 
subject to the condition X’x = 1, has maximum and minimum values 
ja|**, |a|Z attained at x*, x, respectively, as is clear by the method 
of Lagrangian multipliers. Thus 

lal? < &’@’ax < |a|*?, when X’x = 1. 

Now let u be a unit latent vector of a corresponding to a characteristic 
value «. Then au = au, and @0’4’au = aoil’u = aa since D’u = 1; 
whence, taking x = u above, 

jal < da < |a|*. 

4, THEOREM. For any matrices a, b and a scalar matrix k (= k1 

where k is any scalar), 


|k|* = |k|, = ||; |ka|* = |k\\a|*, = | kal, = |A||al,; 
ja~*|* = (lal,)*, — [a*l ye = (lal*)?5 
|a|,/b|, < |ab|, < |ab|* < |a|*{b|*; | |a+b|* < |a|*+|b/*. 


Quart. J. Math. Oxford (2), 2 (1951), 81-4 
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The first two lines are clear. To consider the third, put c = ab, 
and let w be a unit latent vector of ¢’c corresponding to the charac- 
teristic value |c|**. Write 

E. = ax, , = bx, C. = cx, 
where x is a column vector. Then, by the argument indicated in § 3, 
G2 be < Swbu = lel**, if X’x <1; ° 
Wen, <|b|", if Xx <1; 
and, for any real number k, 
ge. <Pilal* if2’x < Fk. 


Now C, = abx = an, = §&,,; 
and, since we now have 
Wien, < |bl*, if Xx <1, 
and Ene one < |a|*?|b|**, if iin, < < |b|*, 


it follows that CC. < |al**|b|/**, if ’x <1. 
Hence, taking x = w, we get |ab|* < |a|*|b|*; and similarly 
|a|,|D|, < |ab},. 
Finally, put c = saga Then, if X’x < 1, 
= FE. + Eo net Heb +H Ne 
< FE. + 2(E, G2) (Hie Ne)! +H Ne 
< (|a|*+ |b|*)?, 
by Cauchy’s inequality. Hence |a+b|* < |a|*+ |b|*. 
4.1. CoroLLaRy. For any matrix sequence aj, ay,..., and for any n, 
finite or infinite, 


TI als < 





n n * 
Ila), <| 1% 


n 
3a < 2 |a,|*. 
v=1 v=1 


Proof is by induction for every finite n, and then letting n > oo. 











n 
<I] |a,|*, 
v=1 








5. Lemma. A matrix tends to the null matrix as its upper modulus 
tends to zero: a> 0 as \a|* > 0. 

By § 3, 0 < x’a’ax < |a|*? 
when X’x = 1. Thus the field of values for the Hermitian matrix 4’a | 
is bounded by |a|**, its maximum characteristic value. 

Now the vth diagonal element of 4’a is 4, a,, where a, is the vth 
column of a and 4, the vth row of 4’. It is a value of &’a, attained for 
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the vth coordinate vector: that is, the vector which has unity for the vth 
element and all other elements zero. 
Hence, for each v, 0< i&,a, < |al*?. 


In other words |a|** bounds the sums of the squares of the elements in 
any column of a, from which the lemma follows. 


6. THEOREM. For any matrix sequence a,, @,..., 


(i) 


= 


,=0 if limja,|* <1; 


z 
I| 
ar 


IMs 


= 
| 
— 


(ii) 


@ 
a, is convergentif > |a,|* is convergent; 
v=] 


ios 


(iii) I (1+a, ) is convergent if Il (1+-|a,|*) is convergent. 
v=1 


2 
ll 


These are consequences of the corollary of § 4.1 and the lemma of § 5. 
Thus, if lim |a,|* < 1, then, on the corollary, 


4 aim | © cs) 
<|IIa) <I] la|* = 
|jv=1 


v=1 
@ 


and then, by the lemma, li a, = 6, 
v=1 


I. 





which proves the first result. 


For the second result, suppose that > |a,|* is convergent. Then the 
v=1 


partial sums form a Cauchy sequence, so that, by use of the corollary 
in § 4.1, 


n * n 
> a| < > |a|*>~0 (mn>o). 
+1 


iv=m+1 | v=m 


Then, by the lemma, it follows that 





0< 





nr 
> a,>0 (m,n>o); 
v=m+1 


and convergence for the matrix sum follows by the general principle of 
convergence applied to each of its element sums. 
To prove the last part, let 
P,=[] (i+a,), Pr= I (1+ |a,|*); 


v=1 


and for m < n, let 
Prin = PalPmr  Pinn = PalPm: 
Then it follows from the theorem of § 4 that 
[Pan —1|* < Pinn—1; 
and further, P,—P,, = P,(Pan—1), 
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so that IP, —P,,|* < P*(P*,,—1). 
Now suppose the convergence: 
P*+P* (n>); 


so that P*,2>1 (m,n>0o). 
Then [P,—Pinl*>0 (m,n 00); 
and hence P,—P,,>90 (m,n—> 0); 


and the convergence of P, (n - 00) follows from the general principle 
of convergence applied to each of the element sequences. 


7. THEOREM. For a matrix function f = f(a,, ay,...), formed from sums 
and products in @ set @, @,... of matrix arguments, if f* = f *(a,, ag,...) 
is the corresponding scalar function in the upper moduli |\a,|*, |a|*.... of 
the arguments, then f£ is defined whenever f* is, and, if ¢ is any charac- 
teristic value of f, then 0 < |4| <f*. 

Writing f as a sum 5 p, of products p, = p,(a,, ag,...) in the matrices 

v 


A,, A,..., let p* = p*(\a,|*, |a,|*,...) be the corresponding product in the 
scalars |a,|*, |a,|*,..., and write f* = f*(a,, a,,...) for the scalar sum 


2 Pye 


Then, by § 4.1, If|* <> Ip |* <> pF, 
and hence ifi* < f°. 
Hence, by § 3, for any characteristic value ¢ of f, 
0</¢| <f*. 


That a series expressing the function f is convergent whenever a 
corresponding one for the function f* is convergent, is clear from the 
theorem of § 6. 

The origin of this note is in seeking a generalization for the theorem 
of Oldenberger’s (1) that the infinite power of a matrix is null if and only 
if the moduli of its characteristic values are all less than unity. This is 
partially achieved in the first result of § 6. Criteria for the convergence 
of certain numerical iterations can be established by using this result. 

The theorem of § 3 has been previously proved on different lines by 
Browne (2). 

The author’s thanks are due to the referee for some helpful criticism. 


REFERENCES 
1. R. Oldenberger, ‘Infinite powers of matrices and characteristic roots’, Duke 
Math. Journ. 6 (1940), 257. 
2. E.T. Browne, ‘The characteristic equation of a matrix’, Bull. American Math. 
Soc. 34 (1928), 363-8. 
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AN ASYMPTOTIC FORMULA IN THE 
THEORY OF PARTITIONS 


By G. SZEKERES (Adelaide) 
[Received 18 January 1950] 


1. Introduction 
THE purpose of this paper is to investigate the asymptotic behaviour, 
as k > 00, n > 00, of the following functions in number theory: 
p(n, k), the number of partitions of n into exactly k positive integral 
yp 
P(n,k) = > p(n,r) 


rsk 
q(n, k), the number of partitions of m into exactly k positive unequal 
parts. 
We do not need to consider each one of these functions separately. 
It follows from the identities 


P(n,k) = p(n-+k,k) = afn+("S'}, x} (1.1) 


that once we have an asymptotic expression for, say, P(n,k), then a 
trivial change in the variables will immediately furnish an expression 
for p(n,k) or g(n,k), and conversely. Actually we shall find it more 
convenient to use a fourth arithmetical function R(N,k) defined by 


R(N,k) = P{N—}k(k+1), B} 
whose generating function can be managed more easily. From the 
asymptotic expression for R(N,k) we obtain the corresponding expres- 
sions for P(n,k), p(n, k), g(n,k) by putting 
N = n+ }k(k+1), N = n+}k(k—3), N = n—}k(k+1) 
respectively. 

The problem of this paper was considered previously by Erdés and 
Lehner,* who investigated P(n, k) in the neighbourhood of k = 4cn* log n, 
¢ = 62-1, which is the ‘normal’ number of summands in the partitions 
of n. They found that 


P(n, k)/P(n) = {1+0(1)}exp(—ce-) (1.2) 


* P. Erdés and J. Lehner, Duke Math. Journ. 8 (1941), 335-45. 


Quart. J. Math. Oxford (2), 2 (1951), 85-108 
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for k = }cn*logn+caAn?, where P(n) = P(n,n) is the number of un- 
restricted partitions of n. Using their method, Auluck, Chowla, and 
Gupta* proved 


ntp(n, k)/P(n) = {1+-0 (1)}exp(—A—ce~) 


for k = tcentlogn+caAnt. Erdés and Lehnerf also investigated P(n, k) 
for small values of k and proved that 


P(n, ), Bn, , a(n) 2 (7 3) (1.3) 


uniformly for k = o(n*). This result can be obtained easily from the 
elementary inequalities 


a" G) <ann < tile) < 204 < a(**6)- ) 


k+1 
1 (n+k—1 1 n+( )-1 
and ail )< Par <p| 2 } 


k-—1 
which follow from the recursive formula 


[n/k} 
q(n, k) = 3 q(n—rk, k—1). (1.4) 
r=1 
To prove, for example, 
1 jn—1 
a(n, ®) < a(t) 
assume induction on k: 


1 n—rk—1 1 n—r—l 1 (fn—1 
am.8) < Ey > ( k—2 )<na| k—2 )= ap) 


The first inequality for g(n, k) can be proved similarly, and the inequali- 
ties for p(n, k) and P(n,k) follow at once from (1.1). 

In the present paper I shall obtain an asymptotic formula for R(N, k) 
which holds in a much wider range than (1.3), namely for N > Ck’, 
where the constant C has the value 0-385 approximately.§ More pre- 
cisely, I shall prove the following theorem: 


* F. C. Auluck, 8. Chowla, and H. Gupta, J. Indian Math. Soc. 6 (1942), 
105-12. 

+ P. Erdés and J. Lehner, Duke Math. Journ. 8 (1941), 335-45. 

t Essentially the same proof was given by H. Gupta for p(n,k), Proc. Indian 
Acad. Sci. (A) 16 (1942), 101-2, and by F. C. Auluck for P(n,k), J. Indian 
Math. Soc. 6 (1942), 113-14. 

§ The best possible value for C is 0-25, since N > 4k(k+1). 
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THEOREM 1. Let 
U = Ut) = t+geF+seiool +... (|t] < 4) 
be defined as the inverse function of 


u iit u aiff, 
i= ( | 4x coth 4x i) = (i+ | = a) » (1.5) 


and let p = 2-598... be the radius of convergence of v = v(t) defined by 





v 


j= oj (2—42 cot 42x) dx)” 


Write « = (k+4)?/N. Then, for every m > 0, 
log R(N,k) = x2 = —log{2 sinh }u(a) nl 


+S Mla) OU") (1.6) 
uniformly for a < 
x(t) = mo _ dog a sinh 3u/(t) }4+- log’ (t)—log(27)  (1.6’) oe 
and the functions y,,(t) are analytic for |t| < p. 
Although I can prove (1.6) only for « < py < 2-598..., and hence for 
N = (k++4)?/a > 0-385k?, it seems very likely that it holds uniformly 
for every N > Ck? with C > }. The following argument suggests that 
(1.6) is valid even in the critical neighbourhood of C = } (or « = 4), 
at least for m = 1 if O(k-*) is replaced by o(1). For, let us make this 
assumption and take the function P(n,k) at k = n. We have 


a = (k+$)/N = (k+4)%{k+ pk(k+ 1}; 


hence, from (1.5) at ¢ = a, 


Po < p, where 
t 


U 





0 


u = c(k+4)(k— 4) *H{(1+0 (k-*)}, 


' and 


2=—log(2 sinh $u) = 2u(~— 7} “loge 
= 2e-Mk-+4)-(k— )H1+0 (k-9)}. 


Also, from (1.5), 


or een 1 )- 
> Reseda (;—t oth 4n) ut ((-Z)-s-} » oe 








88 G. SZEKERES 
and hence w'(x) = dekH{1+0(k-})}. 
Putting these in (1.6), (1.6’), we easily obtain 
P(n,n) = {1+0(1)}4-13-*n—lexp(2c—1nt), 

which is the asymptotic term of the Hardy-Ramanujan formula. By 
a somewhat lengthier computation we can also derive (1.2) from the 
formula if we assume its validity in the region a = 4—8/log’k, 5 > 0, 

The formula (1.6) remains valid if « > 0 as k > oo: that is, if k = o (nt), 
We then have aia 1, w(a)->1, 
and 


log R(N, k) = [2 a)/a—log(~ sinh tole «)}| log 2nVat)-+1+-0 (1) 


= k(2— 750? + soho94—...) + (K—1)log N — 2k log k—log(2z7) +-0 (1). 
Notably, if k = O(nt-), 8 > 0, then, for q(n, k), 
(k—1)log N = (k—1)log n+ (k—1)log{1—}k(k+-1)/n} 


= (k—1)log n—(fk¥/n+-ggk>/n?+ sink" /n®+...)+-0 (1). 
Hence 


g(r, k) ~ n¥fke1(k—1)!}texp{—(J8/n-+ 38h /n?-+ yk? /n?+....)} 
by Stirling’s formula. Similarly 
p(n, k), P(n, k) ~ n*-Xk\(k—1)!}exp(fh3/n—33,15/n? + zh5k?/nF — 
Finally, if k = O(n), then 
q(n, k) ~ n*¥-k!(k—1)!}1exp(— }43/n) 
and p(n, k), P(n, k) ~ n*-lk\(k—1)!} exp(4k3/n). 


Generally, it follows from (1.6) that p(n,k) and P(n,k) are asymptoti- 
cally equal if k = o(n*). This was proved by Erdés and Lehner for 
k = o(n+) and conjectured for k = o(n?). 


2. A conjecture of Auluck, Chowla, and Gupta 
For a fixed n, p(n, k) has a maximum at a certain k = k,(n). Erdés 
proved* that 


ky = 3en* logn+clogent+-o(n*t) (c = 64/7). (2.1) 
He also stated that, if ¢(n, k) has its maximum (for a fixed n) at ko, then 
ky = (2? log 2)ent-+-dnt--o (n*) (2.2) 


for a certain constant d which he did not specify. Auluck, Chowla, and 


* P. Erdés, Bull. American Math. Soc. 52 (1946), 185-8. 
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Gupta conjectured* that p(n, k) is a monotonous function of k: that is, 
p(n, ky) < p(n, ky) ifk, < ky < kyand p(n, k,) > p(n, k,)ifky < ky < ky. 
I shall now show that the analogous conjecture for q(n, k) is true if n is 
sufficiently large and 0-635k? < n.f 
For a fixed n, 
a = oy = (k+4){n+ oe—T(k+-4)3}7 
is a function of k. We have 
Aoy, = o%,—O, 1 = $8(8 +4)k1+ O(k-), (2.3) 
where B = (n+ 4—}6*)"1. 
Also, if ¢(t) is analytic at t = B, then 
AG(a%j,) = $(%,)—G(oR-1) = $'(B)Acy,+ Of(Aa,)>} 
= B(B+-4)d'(B)/2k+ O(k-*) 


and (a,.)+4(a,_,) = 246(8)+ O(k-*). 
From (1.6) and (1.6’) we have, if 
a= N-\(k+4)? = {n—}k(kK+1)}(k+9)? < py < 2°598..., n > 0-635K?, 


log q(n, k) = (k+-4)[ 2u(c;,)/ox,—log{2 sinh $2(a;.)}]+- 
+4 log{ww’ (c;,)}+-log «,—log(k+4)?—log(2) +k, (a;,,)+ O(k-*), 
log q(n, k)—log q(n, kK—1) 


= = k(2 : u’_ ot —— coth du) ay +27 —log(2 sinh $u)+ 





B P 
L/w’. u” 1 2. §. = 
+5 (T+ ae tg bon p+ plan + OC) 
= Bp+8)(5— -eoth du) 25 —log(2sinh $u)-+ 
_ [u B(pB+4) B+4 2 o 
+(5 +5)" 4k + 2k “er ) 


where wu, u’, u” are to be taken at the place B. If we introduce the 
expression (1.7) for wu’, we obtain 


Alog q(n, k) = —log(e"— 1). BB+4) eS (" 


" 
- a +24 ok). (2.4) 
e  & 2k 

* F. C. Auluck, 8. Chowla, and H. Gupta, J. Indian Math. Soc. 6 (1942), 
105-12. 

t These restrictions on n and k are not really important. The first condition 
(that n be large) can probably be discarded if we replace the O-notations through- 
out the proof of Theorem 1 by explicit constants and check the conjecture for 
smaller values of n. But, if we know that the conjecture is true for every n, k 
satisfying, say, 0-9k2 < n, then its validity for k? > n can easily be established 
by induction on n, using the recursion (1.4). 
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This cannot be zero unless 
log{e“®—1} = O(k-), 





i.e. u(B) = log 2+ O(k-), 
aa == gS | = dx = }{(clog 2)-?—1}+4 O(k-), 
0 
B = {(2c?log?2)-1—}} ++ O(k), 
log 2 
since | poy de = Mdn*—log*2) = 4(c-*—log*2).* 


0 
Note that {(2c?log?2)-1—}}-! < 0-7, and hence £ is well within the 
region of validity of (1.6). 
From (1.7) we have 
u’ u” $ }-* 2 te“ 
ae —(t) = —4t) —-—___ 1 — g 
+50 = (20-w—s) 1457) - ape] 


e“—] 





Hence for ¢ = f, 
wu’ = log 2{2B(1—38)}*+O(k), 
un” 


—+—; = (2—§8)-Y{3—(1— $6) Hog 2}+ O(k-). 
Therefore the right-hand side of (2.4) cannot be zero unless 


wp) = log(2-+-7) +00) 








with 
- as’ gg” ee alog 2 
b = 388+ 4)(— +5) +H Oe) = ee 
a = (a/log 2)?. (2.5) 
If u(B) < log(2+6/k), the right-hand side of (2.4) is certainly positive 
unless u(B) = log(2+-b/k)+O(k-), 
i.e. log 2+6/2k 
Oe b\-2 x 2 
B+ = 3+(log2+>, re heh al 


= a—}—b(a—1)(klog 2)-14+ O(k-*). (2.6) 
Similarly (2.4) is negative for u(8) > log(2+6/k) unless (2.6) holds. 


* See D. Bierens de Haan, Nouvelles Tables d’Intégrales définies (New York, 
1939), 151, formula (104, 5). 
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Suppose now that f satisfies (2.6) and let k, be defined by 


B = (ko+3)*{n+ te—Hko +4). (2.7) 
Then, if k, < ky and a, = ki(n+ 4,—}k3)-, we have 
(ay) * > ky *(m+ Ye —4KG) > (1+1/ko)/B, 
which shows that (a,)-1 does not satisfy (2.6) if k, is large. Hence 
Alogq(n,k,) > 0. Similarly we can show that, if k, >k,+1, then 
Alogg(n,k,.) <0. This proves the conjecture for large values of 
n > 0°635k?. 
From (2.6) and (2.7) we obtain for the maximum k, 
(m+ ge )(ko +4) ?°—t = a—}—b(a—1){(ko+ d)log 2}-*+ O(ko*), 
n = a(ky+})?—b(a—1)(ko+4)/log 2+-O(1), 


= a-int-+0(1 —7) [2108 2—43+ O(n-*) 
a 
— 0-764304...nt—0-008428...+O0(n-#). (2.8) 


This improves the result of Erdés and shows that the constant d in 
(2.2) is zero. The error term in (2.8) tends to zero. Hence the position 
of the maximum is exactly known for large values of n. 

It is interesting to compare the value of k, calculated from (2.8) with 
data obtained from a table of Todd’s* containing all the values of 
p(n, k) for n < 100. We obtain for n = 100, calculated from the table 
by means of g(n,k) = o{n—(3}, tl, 

q(100, 6) = 65827, q(100, 7) = 108869, 
q(100, 8) = 116263,  q(100, 9) = 79403, 


whereas (2.8) gives ky ~ 7-63. But even for small values of n the formula 
gives the correct value of k,. For n = 16, 
q(16,2)=7,  9(16,3)=14, (16,4) =9, 
whereas (2.8) gives ky ~ 3-05. 
Another result which follows easily from my formula is an asymptotic 
expression for Q(n), the total number of partitions of m into unequal 
parts. The function 


Q(n, k) = 2 a(n,7) 
is not connected in a simple manner with R(N,k); we therefore have 
Q(n) = p3 q(n, k). (2.9) 


* J. A. Todd, Proc. London Math. Soc. (2) 48 (1945), 229-42. 


to use the formula 
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Let us first consider those terms which belong to k’s in the neighbour. 


hood of ky. 
Write u(a,) = {1-+-A(k)}log 2, A = A(k) = O(n-+), and put 


a = };(7/log 2)? = 1-71.... 
By an easy calculation 


1 1 3 x 
a Xp r a5 
= a—4—2(a—1)A+(3a—3—log 2)A?+ O(A’), 
++ 
b+4 = (n+) +3) 
a 





= (n+ ytof1+(1— “A+ 35 = g—3+e82)-+000), 
(x)/a—log{2 sinh 4u(«)} 
= 2 log 2{a—(a—1)A+ (a—1—}4 log 2)A?+ O(A)}, 
(b-+4) “2 —log(2 sinh 4n)) a to(nt ie) - (1 -7) #+000)], 
{uu'(x)}# = (a—}){2(a—1)} + log 2+ O(A), 
- (n-+-4)(1-Z) 1+ 00}. (2.10) 


Hence 
1 P U , 
a(n, k) = =~ (uu )-toxp{(b-+4)(2" —log(2.sinh 4u))} {1-+- 0-9} 


1 


{2(a—1)}- fexp(3 -tmni(1— (1 —2)x*)]ta +00), 


The condition A = O(n-+) is certainly satisfied if k, < k < k,, where 
1 = a-int—n*, k, = a-*nt+-n*. Therefore 


neu! a(n, me - oe MS 


= mee 





=as5 





{2(a—1)}-#exp(3-#2n#) x 


x > exp|— tant (1-7) ack | 
188 ea 





= {1+o(1 (2(a tant x 


A(k2) 
x | exp|— 3 “tern (1-2) da, 
Ake) 








bo 





It- 


re 
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ne 620 AK = (b+-2)—(6-2) = (1-+0(1)}a-4(1—2)n dd, 


by (2.10). Writing 
1 
y= [bent (1 — 7) 


and, noting that, for k = k, and k = ky, |A(k)| > C,n- for a certain 
positive constant C,, and hence that n#{A(k;)}* > 00 (¢ = 1, 2), we obtain 


q(n, k) = {1+0(1)}4at log 27-In-?3? | e-* dy exp(3-tnn') 
ee g P 
= {1+0(1)}4-13-*n-? exp(3-*znt). 


Now it is easily seen that the terms with k < k, and k > k, in (2.9) do 
not contribute essentially to the value of Q(n), for n*A*(k;) > C? n° 
where 7 = 1, 2. Hence 
(1 — Jaa] < exp(—C, n¥1°) 

a 


1 
exp|—3-+nnt = 


for a positive constant C,, and 
q(n,k;) < q(n, kp)exp(—C, n™°), 
2 Me B+ & Um, k) < akq(n, ky)+a(n, ka)} = 0 g(r, ko)}, 
which implies Q(n) ~ 4-13-4n-texp(3-tnb). 


This is the asymptotic term of the Hardy—Ramanujan formula for Q(n). 
The above proof is entirely independent of the theorem of residues and 
the transformation properties of elliptic modular functions, which seem 
to be indispensable for the proof of the exact formula of Hardy-— 
Ramanujan—Hua.* 

For g(n, ky) = max q(n, k) we obtain 


—t 
q(n, ky) = io-s(1—) n—exp(3-txn'). 


Hence Q(n)/n*q(n, ky) > 3#2#(1—1/a)* = 1-200.... 


Except for the explicit value of the constant on the right, this result 
was also found by Erdés.t 
If we assume the validity of (1.6), (1.6’) at a = 4—8/log*k (8 > 0) 


* L. K. Hua, Trans. American Math. Soc. 51 (1942), 194-201. There is a 
misprint in the main formula on p. 195 of the paper. #(n+;) should read }(n +34). 
t P. Erdés, Bull. American Math. Soc. 52 (1946), 185-8. 
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and do not care about the error terms, then also (2.1) can be obtained 
from the formula. It seems likely that 

ky = 4en* log n+clog cnt+-d-+-o (1) ~ 0-38985...nt log n—0-19403n?+d 
for a certain constant d. Data taken from Todd’s table suggests that 
the value of d must be near to 1-7. 


3. Proof of the main theorem ’ 

The proof of Theorem 1 is based on two results. One is the classical 
theory of ‘waves’ of Sylvester.* Let ¢ denote a primitive qth root of 
unity (¢ < k) and write P;(n,k) for the coefficient of ¢-* in 


fit) = Ce" Ty (1—Be-t), (3.1) 
j=1 
Then P(n,k)= > 5 Pm, k) = Bln, b)+ s & Pi(n,b), (3.2) 
q=1 q=2 


where the summation is to be taken for every primitive qth root of unity 

(q <k). This remarkable formula splits P(n, k) into two parts: a prin- 

cipal term P,(n,k) which governs the asymptotic behaviour of P(n, k), 

and a circulatory part 2 P,(n,k) which depends on the arithmetical 
qa 


properties of n. 

It may seem surprising that the Sylvester formula has never been 
used for the purposes of an asymptotic evaluation. Hardy and Rama- 
nujan, in their famous paper on partitions,t came to a rather unfavour- 
able conclusion with regard to this possibility. I quote from their 
footnote on p. 76: “These (cf. Sylvester’s) formulae do, of course, furnish 
incidentally asymptotic formulae for the functions in question. But 
they are, from this point of view, of a very trivial character: the interest 
which they possess is algebraical.’ The asymptotic formulae to which 
Hardy and Ramanujan refer here are those which hold for fixed k when 
n tends to infinity: p(n, k) ~ n*-1/k! (k—1)!, which incidentally follows 
easily from the elementary formula (1.3). But there is really no need 
to confine ourselves to that trivial situation (when k is fixed), since 
Sylvester’s theorem expresses an identity which holds for every n and k. 
As a matter of fact, the Hardy-Ramanujan formula itself is composed 
of expressions which strongly suggest a close relationship to the corre- 
sponding Sylvester waves for P(n,n). 


* See e.g. L. E. Dickson, History of the theory of numbers (Washington, 1920), 
2, 119. 

+ G. H. Hardy and S. Ramanujan, Proc. London Math. Soc. (2) 17 (1918), 
75-115. 
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The other theorem on which my proof is based is the following 
asymptotic expansion which I obtained quite recently* for the co- 
efficients of certain power series. 


= lL. 
THEOREM 2. Let man 
Vv 


be @ given power series with positive radius of convergence and constant 
term zero, h an integer, « a complex parameter, o, 7 real constants , 
(09<oa< 1). Write 
Gl2) = eRie-te0) — > wi Adtoade, = “(h-+7). (3.3) 
p=0 
Let u = u(t) = u(o,t) denote the inverse function of 


U ca) 
~ ouf(u) - 5 3.4 
+p ot (3.4) 
and let v = v(o,t) be defined by 
v= t(o+ > |c, |e”). 
v=1 
Then 
log A;, ; = log A, (A, «) = (h—l1)log K—K re ot dt-+-r lo g tla) 
on 


0 





Shy (a)-+O(- ™) (3.5) 


for every m > 0 and certain funciions rahe . ear: for |x| < p, where 
p denotes the radius of convergence of v(t). The expansion (3.5) is uniformly 
valid for |x| < po < p. . 


THEOREM 3. Suppose that the coefficients of f(z) are not constants but 
have an asymptotic expansion of the form 


jinn maf > ea k-44.8, B (“ohn JE am(14 3 ‘) | (3.6) 


where C,, E,, a ( >) are constants (independent of k) and |8,| < 1. 
Then Theor em 2 2 still holds if we form u(t) with coefficients C,, independent 


of k viz. 4 Ho < 0,w). 


v=1 


* G. Szekeres, ‘The asymptotic behaviour of the coefficients of certain power 
series’, Acta Sci. Math. Szeged, 12 (1950), 187-98, Theorems 1, 2. 
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CoroLuaRY. If the coefficients of f(z) have the form 
ce, = Of14+6,(v+1)K-} (|8,| <1, C, constant), 





and “= i(o+ ) C,w’), 
v=1 
then log A, = (— eg K-K | a dt+-0(1). (3.1) 


0 
I shall obtain the proof of Theorem 1 by applying the asymptotic 
expansions of Theorems 2, 3 to the coefficients of f(¢, t) in (3.1). Write 
RN, k) = PAN—}k(k+1), &}, 
so that R;,(N,k) is the coefficient of t-1 in 


f(l,t) = C-NHMe+DeNt Il (ett Lie-tity-1, (3.8) 
Sylvester’s theorem states that si 
R(N,k) = RN, H+ R,(N,k). (3.9) 
4. The principal term F,(N, k) . 


I shall prove in this section that R,(N,k) satisfies the asymptotic 
expansion (1.6), (1.6’). From (3.8), 





k 
f(1,t) = aot T] 4vt(sinh 4vt)-1. (4.1) 
Hence, writing 7 = ” 
g(1,t) = exp| Nt— 2 log{(4vt)— sinh bt}] = > =6 1K k)t”, (4.2) } 
ye ace | v! 
we have R,(N,k) = {k!\(k—1)!}-6,_,(N, &). (4.3) 
From (4.2), i> 
g(1,t) = exp[Ne— ees Ae “pres san | (4.4) 
. B, 
= = exp[Nt— a (>+ Noa we rad 


where B,, denotes the pth Bernoulli number (By = 1, B, = 3, B, = y;...). 


k 
To evaluate > v*?, we use a modified form of the Euler-Bernoulli 


v=1 
formula. Let the numbers D,, be defined by 
: el 
D=1, Dm tou 3,%...) 4.5) 
; ps ran = 0 a 
[See Norlund,* where it is shown that D,, = (—1)*(2%—2)B,.] 


* N. E. Nérlund, Differenzenrechnung (Berlin, 1924), 27-8. 
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Then, for m > 0, p > 0, 


—_ > mo + ‘| #D, yw(k-+- 4) +- *tISB, (Ey eet ute-tnet 
2 2m 


(j8] <1). (4.6) 
The formula is a direct consequence of a formula given by Nérlund 
floc. cit. 31]. 
Combining (4.6) and (4.4), we obtain 


g(1,t) = exp(Nt— 


m—1 2 1 
ee esol tengo 


or aiitiade introducing 
N 1 
c= t= A(k+ bt, a = (+4), 


‘i 
exp[+H)[2— > Se(aay]] 


p=1 


= ae ~ Ayla, k)2”, (4.7) 


& 
- 
A. 3 

iL 

a 
tol es 

— 
+ 
toe 
— 
x 

— —— 

ll 

- 





m—1 ‘ 
Cap = (—WP P| DBD (EE (eb 


I9n-+1 1 2p—2m+1 
+Bn8y( Pr (+325) atom], (4.8) 





and A,(a,k) = b,(N, wy) = = b,(N, mle J. (4.9) 


k+4 
We apply Theorem 3 to G(1,z), writing in (3.6) 
o= 1, ¢ = i, h=k, 


B, Crp-1 - 0, £,= = 2-%f— 


Cop sis (—1)-* 2p? 


@= |, 





(2p+1)!” 


3695 .2.2 H 
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We have 


- {+ > Fr) 7 (Hee i I ae) 
0 








p=1 
[ (uct) oe? at = 229) _ Jog a-tlog{? sinh }u(a)}, 
0 
by (1.7). Hence Theorem 3 gives 
log Ay_4(«,k) = (k—1)log(k-+-4)— 
—(k-+4) 2-2") _ og a-+-log{2 sinh du(a)}} + 


u(x) 


+3 log—— 
that is, by (4.9), 
log b,_,(N, k) = (k—1)log(k+-4) ee 1(«, k) 


= 2(k—I)log(k+4)-+(E+4){2 2) —logt2 sinh du(a)}} — 


O4 Hlogu'(a)-+"S dy(a)k-A+-O(k-™): (4.10) 


m1 
— 2(k+-4)+log «+4 log{uw’ («)}+ 'S $,(a)k-# + O(k-™), 
p=1 
Stirling’s formula implies 
log{k!(k—1)!} 


= 2klog k—2k+log(27)+ > ae 


Hence, by (4.3), 
log R,(N,k) = ore (NV, k)—log{k!(k—1)}} 


= b+p)[2%) —log{2 sinh }u(a a)}| +-2bog(1 + 57} —1—2log(i-+4)+ 
+log «a—log(27)+-4 log{uw’(«)}+- 





hk tat O(k ;—2m+1), 


[m/2] B — 
inn ae v —2v+1 = —m 





= &| 2) —logte sinh $u(«) | —log v4 4) — lo la du(a)) + 


+ #og.’(a)—log(2n)-+"S k-"Y,(a)-+O(k-™). 


This proves the statement. The range of validity of the expression is 
given by the radius of convergence of v(t) defined by 


t= ofl +> a i wwe) = (| (2—4z cot 42) az) 
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Since the coefficients B,/(2p+-1)! are positive, 


: 


on the circle |v| = r. Hence the minimum of |é(v)| on |v| = r is 


(2—4zcot 4z) dz| < | (2—42 cot $x) dx 








ot 


0 


M(r) = n( { (2—4 cot 42) ax) 
0 


It follows from the Biirmann—Lagrange theorem* that v(t) is certainly 
convergent in a circle with radius M(r). The maximum p = M(R) of 


M(r) is where 
R 


4R(2—4R cot $R) = | (2—4xcot 42) de, 
0 
ie. at R = 4-2048.... Hence 


2 


PS. “8 
>= re | (2—42 cot $2) as) = R(1—} Root 4R)- = 2-5984.... 
0 


5. The circulatory part 

I have to show now that the circulatory part has a smaller order of 
magnitude than the principal term R,(N,k). This corresponds to the 
fact (which, however, I shall not use) that z = 1 is the ‘strongest’ 
singularity of the generating function 


{(1—2)(1—22)...(1—2*)} 1 = 1+ > P(n, ken, 


In fact, I shall prove that 
k 
DE RAN, b = of RW, B} 
for every m > 0. 
We first consider the dominating term R_,(N,k), which can be treated 
very similarly to R,(N,k). We have from (3.8) 


f(—1, 8) = (—1)-N HMR Q-K (Hh!) “Tet II {cosh(v—4)4}—2 Il vi(sinh vt)-1, 
v=1 v=1 


h=[3k], hh’ =[3k+4], 
and R_,(N,k) = (—1)-¥+4*+02-*{h1(h—1)}18,_,(—1,. Nk), (5.1) 


* See Hurwitz—Courant, Funktionentheorie, 2. Aufl. (Berlin, 1925), 139. 
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where 
co 


1 
g(—1,2) = > o(—-LN, kt” 


= 
i] 
) 





vt 


h’ @ 
(2°—1)B, = 
“i > > Ziel er aa! [(2»—1)¢]*? 


<< B 
722,07" we oy | 


(227—1)B, i; 
2p(2p)! 


- 5 [Serratia 


ae 
= exp(NE—H+b > (—1P4 ox 


x [1+38,(2p-+Dk*]2(k-+4)}"} (8 <). 


The Euler formula is used here in the form 
(ak+4) [sk] 1 
Sop, Sve app Get Herr + Se + Hy? ((8] <I). 


We change the variable ¢ into 


— 2N bas h+1, 


k+4 a 
and apply the corollary of Theorem 3 to 


exp i+) (2— > (1p, 


We obtain as in the previous section that 


h-1 
log{nx(—1.¥,B(ag%5) | = @—Wlogte+D}-HE+D x 


~~ Sot 
y=1 


I 
@ 

4 
i 
» % 
= 


= 
i] 
- 
8 
ll 
e 





3 (2—1)%"|(—1 





{148,(2p-+1)k (a2) 


x{2— 2 — MO) + toa(= sinh $u(a «)}} +00), 
log b,_,(—1, N, k) = 2(h—1)log(k+-4)—(h—1)log a— 
—h2— 2MO) + Nog(- sinh $u(a «))} +00). 








Si 


g 
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Hence, by (5.1) and Stirling’s formula, 
log{(— 1)N-*#k+DR_(N, k)} 
= —klog 2—log{h!(h—1)!}+log b,_,(—1, N, &) 


= n{2% i —log(2 sinh 4u(« ))} -Iog V+-0(1) 


Since 
Mo) _ logte sinh 4u(a)} > 2%) _ 44a) 
- 2u(a)(2—2) - s|eai® >0 
a wu 7] 


uje 
0 

and h = 4k, it follows that R_,(N,k) has a smaller exponential order 

than R,(N,k) and that R_,(N,k) = o{k-"R,(N,k)} for every m > 0. 


6. The circulatory part (continued) 
Suppose now that q > 2 and write 


—— |. k=ghtr (0<r<q), 


a(r,¢) = (1—f)(1—2?)...(1—- 0") if O<r <q, 7(0,f) = 1. (6.1) 
Then 


n(q—1,£) = (1—)...(1— 23) = lim = ¢, 


—l 
and, from (3.8), 
f(E,t) = C-NMR Er(r, L)q?*hl}—-M-PeM x 
s h 
x T]’ (1—&)(e#— Ge) TT dqvt(sinh 4qrt)-. 
y= v=1 


Here II’ signifies that the product index j takes only those values 
which are not multiples of g. A similar remark applies to x’ below. 
Write 


= k 
(t,t) = > ie 1AiCN, ker = exp(Nt— ¥’ log[(1—2)-4( eb" Ley] — 
p= w 


7 


h 
— 2 log[(da-t)~* sinh jqrt]}. (6.2) 
Then 
RN, k) = 
Now, if 1 = [4q], 
|a(r,£)| = \(1—Q)...(1—2")| > |(1—e2*#/a)(1 —e4milay, (1 —e%7ila) | 
1 


I 
> a= (jusew. 
tq \q 


L-N+IMk+Df9r(r, L)q?*h!(h—1)!}-1,_4(L, NV, bh). 
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Hence |Ry(N,k)| < q-**{h!(h—1)!}-*e#9|b,_1 (2, N, k)]. (6.3) 


To obtain an estimate for b,_, we have to study the function in the 
exponent of (6.2). Let us write, for B + 1, 





$(8,t) = —log® = Be “2 (B, p)tH (6.4) 
We have 
_ B+l1 


BB, +2) = (B—1)* P(Lyo + Ly B+... +Ly,BY)B (wu > 0), (6.5) 
where the numbers L,, are positive integers satisfying the recursion 


Ly says = (ut1—v)L,,+0+2) Li 4 (u > v > 0), 


Leo = Lyp = Ly = 1. | (6.6) 
— 
For $(B,t) = ae 


and #428, t) = (B—e/)-»-2Bet SL, Pre" (uw > 0), 
v=0 


as is easily verified by induction. 
Writing ¢ = (B—1)# and making B > 1 in (6.4) and (6.5), we obtain, 


— lim(1—f)-1{e##8-)_ Be-48-D} — 12, 
poi 
ote tl, ae 
Gray net pat ted = 
Le. Lygt Lyyt+-.+ Lay = (ut+))!. 


Hence for an arbitrary primitive gth root of unity { (q > 3) 


BOW) < Nimmo < (5). (6.0 


Next I show that 


Ss BY, 2v—1) = 0, Ss B(fi,2v) = (— 1) $2 (@"—1) (y == I, 3....). 
a . (6.8) 


Using the identity 


Te-1) = 1), 
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we obtain 


= it <. dé 
>, 8 log = —}}(q—1)t+ ie log rz 








o-t 4 og @t__} 
ie ies CF ne -tntnsten tn 
iq iye+log | | Tap = ~ Hatt log 
ome B, 2v 2v 
= log (= sinh ia!) le (7 sinh it) = — BQ) 1 (7 1)”. 
Then (6.8) follows hen this and (6.4). 
Finally we note the following formula: 
Let 0<j<g,h'= mee then 
y 
+3)? (k+-4)?+1+-8(k §| <1). 6.9 
Dorr = petra tar (i81<1). (69) 


This follows from Euler’s formula or simply by comparing 


h’ \ p (1/ak+4) 
y) , 

> (>+ ) with | xP dz. 
q 


v=0 0 


Now we can develop the function in the exponent of (6.2) into a 
power series. For the first sum we have 


_ b log{( (1—Z)- 1(etst— fie—tit)} 


k k Ce) 1 
= SY’ d(f, jt) = — , by (6.4), 
>’ HG, jt) meas p)jtp, by (6.4) 








2] ~ ; 
= Dav Be ?) > (qv+j)” 
p=1 j=1 v=0 
co 1 q-1 ’ 
— ugh BG k+4)P+1+8(k Pp by (6.9), 
23 2, (fi, Po py et betta +a}, y (6.9) 


_ k+4 . mn a B, Pane v 
| = ae (—1) De HI I){(k+4)3"+ 


0 Sls, sigik-+a) by (6.7), (6.8). 


ae! 
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For the second sum we have 


— > log{($qvt)—1 sinh 4qvt} 


h hesd B, ‘ os) B ‘ h 
=s —])? _—_” _ (qvt)*? — — p v)2 
>, D>, Wage on 2 | pep 2, 








v=lp=1 
k+ 
1 FD, OP pane + > 8p 5 ae 
_Ftt S B, 
ti Ate area b+ 4)t}27-4 > “e{ Sgt)” 
cee B,/|( 2p)! (q/3v3)?”. 


Combining these results with (6.2), we have 


Mest) = YS blGsN Re 


Be exp( eS #4 (4 =P _[a(k-+4) et}? + 


at 


2p(2p+1)! 
Pe (sFge+o) iy ? 
Changing ¢ into 
Nq,_k+4 
oe a 


and noting that 


a < p= 2-5984... << 3V3(k+4) * a(k-+-q) 


> ate Ss 1, 
k+q 3V3(k+-4) ~ 


we obtain 


D,silgery) GN be = Oackennle Se areeen) | 
= Gq, sesrla > | 


= (9,2) S ii 


p= 


* As a matter of fact, p = 2-5984... is slightly bigger than 43V3 and the above 
inequality is not true ifh = 1 and gis very near tok. But, ifqg > 4, then obviously 
33 can be replaced by 4¥V2 in (6.7), and the inequality becomes a < 2V2 = 2°8.... 
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where 


G(q, 2) = exp{7 (k+-4){z \(e— oO 


— 1 
oe 2p ho v 
pp pip) }} = La” A,#. 
(6.10) 
Hence 
1 
(h—1)! 
Let us put K = (k+-4)/q and apply Theorem 2 to the coefficients 
Aj; l=oK (¢< 1). 


(te). 4, (6.11) 
Vv: 


bn-alb 4) < (Ze+a)} : 


pt+v=h-1 


We have, for wu = u(oa, t), 


© | 1 by -1 
ra ot p—1 u2 = re Be 
= a )z ai w| ufo 144 | doth jas) 
si 0 
(6.12) 





and, from (3.5), 


log A,_, = (log K—K | MA" ae+-041) 
0 


u(o, x) _9 u(a, «) 
ou a 


sinh }u(o, a) +00), (6.13) 


= 





os (oK—1)log K—K{20-+elog 





+loe(= 2) 


since, by (6.12), ~ =+* —~—}eoth in) = == wa, 
a\a 


Stirling’s formula gives 


1 
log yi Ara] = log A,_,—(I—4)log/-+1+-0(1) 








= k{2"0) _ 4 —clog“(®: ~) —og| = 


a 


= ee su(a, «)|}— 


—log K+4logl+O(1). (6.14) 


The expression in the brackets {} has a maximum (at fixed a) if 


ks [2m o—alog 1%) tog sinh 4u(o, >} a= 0, 
a 


do | oY 





2 
u(a, x) 
1 


Fel; tga booth an = 1+log”. 
u ou 


dala 
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Here the left-hand side is 1, as is easily seen from (6.12). Hence (6.14) 
is maximum if u(o,«) = «, i.e., by (6.12), if 
~ B 
= |j— =, es a2P = , aa 
o=] 2 § 1) @pi! ~ | decoth de de. 
0 
For this value of o, the expression in (6.14) becomes 


a 


2— o—log(= sinh in) = if 4x coth $x da—log{ sinh i), 
a 
and 


log | ni4e a} <a! j te coth Je de —log(” sinh js) 


Combining this with (6.11) and (6.3), and noting that 


i (eed) <h—l 


qtu—l\ _ (ath) _ (+h)! 
- ( y )< (5) = Sir 








in (6.11), we obtain 
log Ry(W, k)| < —2hlogq—(h-+- Plogh-+-h+-Ig-+(h— log 2(k-+-4)} + 
+(q+A)log(q+h)—q log q—h log h+ 


+h (2 j 4x coth $x de—log( sinh is 


= af +- | 4x coth $a dx—log(2 sinh is —log N+-Alog(1+-¢/h)+ 
0 


+qlog(1+-h/q)+3q+ O(log h). 


Here hlog(1+q/h) = O(k*), qlog(1+h/q) = O(k) 
since A =] 

gh} 
Hence 


log|R,(N,k)| < af +2 | 4x coth $a dx—log(2 sinh is) _ 
0 


—log N+ 49+ 0(#) 


< wef +5 | $x coth $x dx—log(2 sinh 10)+4| st 


—log N+ O(K*). 








H 


Fo 
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In § 4 we have proved that 


log R,(N,k) = x2 29) _ togi sinh Ju(a)]} log W-+-0(1). 


Hence we have finished the proof if we can show that 


e+ | 4x coth 42 dx—log(2 sinh 4a) < 32% 2 Mla) —log| 2 sinh $u(« y} 


= 3 (2 | 4a coth 42 dx—log{ 2 sinh iw(a]| - (6.15) 
lw J 


Now, if w > 0, then 
Uu 
(i) . [ 4x coth 4x dx is monotone increasing, 
w J 
u 


(ii) [ 4x coth 4x dx —log( = sinh yn) is monotone decreasing, 


ty 


0 


uU 


| 4x coth 4a dx— log(= sinh yu) is monotone increasing with w. 


Sl ro 


(iii) 


Se 


For, 
uu 


(i) mar { 4x coth 4x tr) = = nite coth 4u— — ~{ 4x coth 4x i) >0 


0 0 


if u > 0, 


i) — ar] Jeeoth tx de—log(= sinh 4) 


du \2 


a 
~ u 


2(1—3 f scot ax <0 ifu>0, 


0 


(iii) ) 2 i 42x coth 4x dx— log(= sinh is) 


du 


= 2 (i coth 4u-+-u—2 | 4x coth 4x ax) 
0 
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and the last expression is positive for u > 0, since 
u 
s (4 coth ju-+-s—8 | seooth fe as) = 1—}u*(cosech 4u)* > 0. 
0 


Hence oe 
+2 { $x coth $a dx— log(= sinh is) <3 by (ii), 
a 


0 


2{2 f gecotne d—tog(=sinh 4) >6_ by (iii), 
0 


ve 


and (6.15) is true if 
§—loga < 6—3log u(a), 3 log u(a)—loga < 3. 


Now ma) _ i 4x coth $x dx 


is increasing with wu (and a) by (i. Hence 3 log u(«)—log « is monotone 
increasing with a. But, if w(«) = 3-5, then 


35 «ip 
an ao | $a coth $a az] = 2°68... > p, 
0 


3 log u(«)—loga < 3log 3-5—log 2-68 < 3, 
which proves (6.15). 


[Note added 14 July 1950.] 
From this point of view it is interesting to compare by a numerical example 


the Sylvester terms for P(n,) and the Hardy-Ramanujan terms for P(n). For 
q <n, write P(n) = > P,(n,n), where { runs through the primitive qth roots 
of unity. Taking n = 8 we obtain 
P,(8) = 21-4127, P,(8) = 0-4112, P,(8) = —0-0566, P,(8) = 0-1016, 
P,(8) = 0, P,(8) = 0-0278, P,(8) = 0-0408, P,(8) = 0-0625, 
whereas the first five terms of the modified exact Hardy—Ramanujan—Rade- 
macher series are 
21-7092 + 0-3463 — 0-0896 + 0-0500—0-0192-+.... 
Of course, the sum of both series is P(8) = 22. 

For large values of n, I suspect a very close approximation of the P,(n) by 
the corresponding Hardy—Ramanujan—Rademacher terms. Unfortunately, it is 
not easy to obtain data for comparison since the computation of the Sylvester 
waves becomes almost prohibitive for large values of n. In one instance I have 
calculated P,(40) and found + 7-4784, whereas the second Rademacher term for 
P(40) is +7-4311. In the computation I was kindly assisted by the Mathe- 
matical Statistics section of the Commonwealth Scientific and Industrial Re- 
search Organization. 
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OVERCONVERGENCE PROPERTIES OF SOME 
INTERPOLATION SERIES 
By SHEILA SCOTT MACINTYRE (Aberdeen) 
[Received 3 February 1950] 


I. LipsToNE SERIES 
1. Classical results connected with the Lidstone expansion 


SLs ada) +F2)8:)] (1) 


are embodied in Theorem I [(6) 287 and (8) 455] and Theorem IT 
[(6) 292 and (8) 455] below. M(r) is the maximum modulus of f(z) on 
\z|} = r. By a derivative of order 0 is understood, as usual, the function 
itself, and «,(z), 8,(z) are the Lidstone polynomials defined by the 


formulae 4"(z) = onal), BZ(2) = By alz) (n > 1) (2) 
together with 
%(z) = (z—b)/(a—d), Bo(z) = (z—a)/(b—a), (3) 
OX, (a) = p(B) _ B,,(@) = B,,(6) = 0 (n => 1). (4) 


Alternative definitions due to Poritsky and equivalent to (2), (3), (4) are 
contained in (7), (8), (9). 
THEOREM I. Jf f(z) is an integral function satisfying 
fe) = Oe!) (0< k < a/|a—6)), (5) 
then the Lidstone series (1) converges to f(z) for all z, the convergence being 
uniform in any bounded region. 


THEOREM II. [Jf the series 
2,|% o,(z) +5; B,(z)] 


converges for any value of z, not congruent to a (mod(b—a)), then it con- 
verges to an integral function f(z) uniformly in every bounded region of the 
plane, and f(z) satisfies 
fP(a) = ay, f(b) = by 

It is seen from Theorem II that there can be no phenomenon of over- 
convergence in the Ostrowski sense associated with Lidstone series. 

Ostrowski (5) showed that, provided that there are gaps in the power 
series f(z) = > a, 2” satisfying certain conditions, a sequence of partial 
sums of the power series converges uniformly in a region which extends 
outside the circle of convergence of f(z). 


Quart. J. Math. Oxford (2), 2 (1951), 109-20 
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As is seen from Theorem II, the Lidstone series possesses no region 
of convergence other than the whole plane, so it is clear that any ‘over- 
convergence’ property must be of a different nature. I show that a 
sequence of partial sums of the Lidstone series may converge to f(z) in 
the whole plane or in a part of the plane although the Lidstone series 
itself does not converge for any value of z, except at points congruent 
to a (mod(b—a)). Results of this nature are given in Theorems III, IV, 
V, and functions are constructed to illustrate the results. I describe the 
Lidstone series as exhibiting ‘overconvergence’ in these circumstances. 

Before establishing these theorems, we require formulae obtained by 
Poritsky [see (6) 282-9] in the course of his proof of Theorem I: namely 


He) = ¥ [F(a)ae(2)+F°%O)B@)}+Ta(2) 


a b 
where —7,,(z) = [ f2"+%(s)A,(z, 8) ds+ [ f2"+s)B,(z,s) ds, (6) 


the integrations being carried out along any paths on which f(s) is 

analytic, and where 

0A,,(z, 8) 
os 


OB, (z, 8) 


» Bale) = 2 (7 





O, (2) - 








> 
s=a s=b 


A,,(z, 8), B,(z,s) being defined by their generating functions as follows: 
sinh A(z—b)sinh A(s—a) 
Asinh A(a—b) , 





A(z, 8,2) = 5 AQl, am (8) 


Biz, 8,d) = $B, (2,8) = —A(s,z,d). (9) 


n=0 


Moreover, if a = 0, b = a, he proves that 
A, (2,8), —B,(z,8) = (—1)"2sinzsins-+O(}-++e)™ (e > 0) 
7 


uniformly for z and s in any bounded part of their planes; and, using 
elementary transformations, we find that in the general case this 
becomes 


A, (2,8), —B,(z, 8) 


2(b—a)2"+1 ii (z—a)r oie Sar +o P oe +¢ 2n (c sy 0). 


rr b—a b—a 2a 
(10) 


Since (5) implies [f(z)| < Cnikn (11) 


=(-1) 
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if z is restricted to any bounded region [see (6) 289], Poritsky’s proof 
of Theorem I follows. Indeed, condition (5) may be sharpened to 





fle) = of z\4exp}-*5}, (5a) 
as has been shown by Boas [see (1) Theorem 2 A]. 


2. Now it follows from (8) and (9) that 


i a ne __»)2n4+1 
A(z, 8,A)-+B(z,8,A) = = z) S y2n —— 
=0 ‘ 








and hence that 
(s—z)2"+1 


na (2n+1)! ° 
From (6), provided that the integrations are carried out along paths on 
which f(z) is analytic, 








A,,(z, 8,A)+- B,(z, s, A) 


Ty(2) = | fOn4%(s)A,( 


a 


x 


b 
,8) ds+ {fe +2)(s) B. (z, 8) ds+- 


a 


+ | f2n+2(s){A,(z, 8) +B,(z, s)} ds 


z 
a b 


= | f2n+2(s)4,,(z, 8) ds+ [ f2"+%(s)B,(z, 8) ds + 


a a 


+ f f+ (9)(s—2)+4/(2n-+1)! de 


= R,,(z) +8op +2(2, a), 


where « is any point, S,(z,z9) is the remainder after p terms in the 
Taylor series for f(z) with centre z), and 


a b 
R,(z) = [ f2"+2(s)4,,(2, 8) ds+- [ £2"#2)(s) Bale, s) ds. 
Let the circle of convergence of f(z) with centre « be denoted by C,, 
and suppose that there exists a rectifiable curve I’, joining a to 6 and 
passing through «, along which f(z) is analytic and on which 
lim max | f@(z) |{a/|a—b|}-*" = 0. (12) 
ae oom 


no ™ 


It follows from (10) and (12) that R,,(z) tends to zero uniformly in any 
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bounded region as n tends to infinity through a suitable sequence. Also 
Son+9(%, x) tends to zero uniformly for z in C,, and we have 


THEoREM III. Jf f(z) is analytic and satisfies (12) on a rectifiable 
curve I’ joining a and b, then there exists a sequence of partial sums of 
the Lidstone series (1) which converges uniformly to f(z) in the interior 
of the circle of convergence for f(z) with centre at any point of T. 

We have also established 


THEOREM IV. If there exists a sequence n, such that the corresponding 
sequence of partial sums S,,, of the Taylor series for f(z), with centre any 
point a, converges uniformly to f(z) in a region D and if f(z) is analytic on 
a rectifiable curve I’, joining a, «, and b, on which it satisfies 

lim max | f@"»)(z) |{2/|a—b|}-™ = 0, 

nso oT 
then there is a sub-sequence n, such that the corresponding sequence of 
partial sums of the Lidstone series (1) converges uniformly to f(z) in D. 


3. We also have a theorem about integral functions of ‘lower type’ 
less than z/|a—6|, ie. an integral function whose maximum modulus 


satisfies 
is log Mir) oe 7 
eg |a—b| 


THEOREM V. [Jf f(z) is an integral function satisfying 


lim soa in otisiietl, 
then a sequence of partial sums of the Lidstone series (1) converges to f(z) 
for all z, the convergence being uniform in any bounded region. 


This follows from the argument in § 2 with the aid of 
Lemma I. Jf f(z) is an integral function satisfying 


La (13) 


lim 
—— r 


To 


then, if D is a bounded set, 


lim max | f@™(z)|¥2" < o. 


To prove Lemma 1, we observe that, since (13) is satisfied, then, if 1 >, 
M(r,,) < explr,, (14) 
where 7, is a sequence which tends to infinity. Now 


fre) = Pf FO _ ay 


Qri | (C—zypri 
Cc 








a —  —— ae | 


~_3= — om 








f 


18 


o; 
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where C is any contour surrounding =z. Let C be the contour 


\¢] = 7, where |z| <7,. Then 
p\M(r 


| #(p)(+)\!) 4 
Oe) < (15) 


Now, if p,, = [4/,], then, by (14) and (15), and Stirling’s approximation 








for n! 
, (2p,,)! e2Pnt? [2Pn 


| f Pn)(z) | < 


" ~ (2p,)??o{1—3]2|1/p,}??at 


a\(4rrp,, e?l??»{1-+-0 (1)} 
om {1—$|2|l/p,}?P9*4 








as n tends to infinity, and therefore 
|fPm(2) MP» < If1+0(1)} 
if z is restricted to any bounded set. Therefore 
lim max| f 2™)(z) [120 < l, 
n—>2o 
provided that z belongs to any given bounded set. But / is any number 
greater than o. The result follows. 


4. Functions exist which are not integral functions of the kind con- 
sidered in Theorem V, but which satisfy the conditions of Theorem ITI. 
For example, taking a = « = 0, b = jn, T the section of the real axis 


pat J 
0 <a < }r, let d(z) = 2-14 22s-1y (16) 


where A,, is an increasing sequence of positive integers. The power series 
(16) has unit radius of convergence and therefore has a singularity at 
one point (at least) on the unit circle and cannot therefore represent an 
integral function. The Lidstone series for ¢(z) cannot therefore converge 
to ¢(z) in any domain D, as in that case (by Theorem IT) ¢(z) would 
coincide with an integral function in D and therefore in the whole plane. 
We can, however, show that it is possible, provided that x lies on IT, to 
select the sequence A,, in such a way that 

max |g®(x)| <1 (17) 

O0<2x<ji7r 
and thus, since 7/|a—b| = 4, (12) is satisfied for the sequence A,, with 
f(z) = d(z), and this function therefore satisfies the conditions of 
Theorem ITI. 
For, with any integer A,, and any x on I, 


o 


a “;) = > r(r—1)...(r—2A, + 1)ar—™s, (18) 


dx?>n j— x r=2A, 


‘ 
3695 .2.2 I 
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and this is a convergent series; thus, if we write 
ma = % 4 a’ + Ry(x), 
l—x =o ‘ 
the remainder of the series (18) after N—2A, terms, namely 
d*» Ry(x)/da™, 
can be made as small as we please if N is sufficiently large; in fact 


dn Ry (x ) 
dx?An 


provided that VN > N,(A,,). But, if we choose d,,,, so that 
N= 2An+—1 Z Np 


<1 


d* R(x) 
) N 

then 0 < d(x) < a’ 
and thus 0 < p(x) <1, 


and (17) is satisfied. 

We can construct a function to illustrate Theorem IV by a similar 
treatment of a (suitably chosen) series exhibiting overconvergence in 
the Ostrowski sense. For example, consider 


We) = ¥ fe(1-+2)}""/Py (16a) 


where yz, is an increasing sequence of positive integers satisfying 


Pn > 2pp»t+2, 
and p,, is the maximal coefficient in the polynomial {z(1-+-z)}#». In each 
of the polynomials {z(1-+-z)}#»/p,,, the moduli of the coefficients do not 
exceed unity and one of them is actually equal to unity. On the other 
hand, the degree of the highest term of this polynomial is 2u,, and that 
of the lowest term of the next one is y,,,,. Hence, since 


Pav > Zu +2 ~ 2uns 


the individual powers of z in different terms of (16a) do not overlap, 
so that the developments of all terms of (16 a) written down successively 
form a power series P(z) with the radius of convergence 1 (since the 
moduli of the coefficients of P(z) do not exceed 1 and an infinite number 
are actually equal to 1). The series (16a) is the sequence S,,,, of partial 
sums of P(z) and therefore converges uniformly in the interior of the 
unit-circle. 

Noting that 2#»/(u,+1) < p, < 2», we see that the region of con- 
vergence of (16a) is |z(1+-z)| < 2: that is a ‘dumb-bell shaped’ region D 
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which encloses the circles |z| < 1 and |z+1]| < 1, and touches them at 
= —2 and z= 1. Thus P(z) exhibits overconvergence in D in the 
sense of Ostrowski. 
Now, as in the discussion of the function ¢(z), we may make 
d?n*? Ry (z) 


dain? 





<1, 


provided that VN > N,(y,,). So, if we choose p,,,, so that N = p,,, >, 


re d2Hnt2R (x 
0 < YPrnt(x) < te) <1, 


and thus (z) satisfies 
max |YHnat2)(x)| < 1. 


O0<2x<}r 
Therefore, if we take a = a = 0, b = }n as before, T' the segment 
0 <a < }r of the real axis, f°"(x){a|a—b|}-*" tends to zero as n tends 
to infinity through the sequence consisting of the integers »,-+1, and 
we have seen that the partial sums 4,,,,. (= ),,) also converge 
uniformly in D. Thus all the conditions of Theorem IV are satisfied 
(n,, being the complete sequence »,, identical with the sequence p,,+ 1). 


II. GONTCHAROFF SERIES 

5. Poritsky has extended his methods for the Lidstone series to the 
generalized case where a set of m points replaces the pair of points a, b. 
Using his formulae, we can prove generalizations of Theorems IIT, IV, V. 
It may, however, be of more interest to prove the corresponding theorems 
for the m-point Gontcharoff series, which has been discussed by Gont- 
charoff (3) and others. 

By repeated integration by parts of the multiple integral 





zZ zZ 2{n—1) 
Fait) = [ dz’ | dz”... [ L(Y) d&, 
ay ds din 
where @,,,, = @,, we obtain the series in question, viz. 
ice m a 
2, 2 Pingsi-al) for(i), (19) 
j=0 i= 
where 
Zz 2 2\k—-1) 
Piz)=1, P,(z)= [ de’ f de" " | dZ®) (k>1). 
ay ag ak 


In order to investigate its overconvergence properties I shall find 
expressions for the remainder 7, ,,(z) in the form of m single integrals. 
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To be precise, I shall find sequences of functions H, ,,(z,2; 01, @2,..., Oy) 


to satisfy the identity 


Zz 


fue jar.. i u(x) da =3 | Fnle, 23 ay, gy) dy )ee(a) dx, 


a 


where w(x) is an arbitrary analytic function, and thus to satisfy 


2{n—1) a oe 
jae far. is a 4 j* in-1 (2’ » ©; Ag, Ag,.--, Am, a 1)u (x a) da, 
7 (20) 
where, in both cases @,,,, = @,. Clearly 
A, (2, ©}, Ag+; Um) =], Hi, (2, £3 @,, Xg,..., By) =0 (¢>2). 
(21) 
Integrating (20) we find 
Zz 2’ 2fa—1) 
dz’ | dz’... u(x) dx 
va a 


n Zz 2 a 
“2 | ae’ | FH, ,-1(2', ©} Ag, Ag,.+-) A, 4) U(x) dar 
_— a a 


1 +1 


i=1 as 


m—1{ * eo 
= {f u(x) dx | Fi, n-1(2', ©; Ag, Bg,.++) Bq, @y) Az’ + 
x 


oe | ~@) de { Biya ©} Bg, Bg,..+) By, Ay) az’) + 


Gi+1 


, 


+ fu (a) de fH m,n-1(2 » U3 Bq, Ag,.--, By, 2) dz’. 


- m1 @ _ 
Hy (25%; Ay, Ag3++)%m) = > | 4, n—1(2' 5 5 Ag, Agy+++ bys Ay) dz’ +- 
zx 


+ | Fy, n-1(2 1B} Bq, Bes...) Bn, @,) Gz’ (22) 
x 


and 


Z 
F, 1,(2, ©; Ay, Ag,.-+, Am) — | i-1,n-1(2' ) 2} @g, Ag,.. +» Am; My) dz (¢ => 2) 


‘ (23) 








3) | 
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and it follows that 


OH, (2, X; Ay, Ag,..-, Um) a (€, 25 6g, qy.++, Mey My) 
- 2%» ’ geee m? 














a = m,n—-1 
. dz (24) 
OH, ,,(2, #3 Ay, Ag,...5 Dn =S 
i 22 , = Hy_4n-1(%, U5 Mg, Mg5-++;%my%) (1 D 2) 
and, repeating the process, 
ie —H,,_, (alli;n >m+1). 
em ' 
By induction, using (21), (22), (23), we have 
—__y\jn-1 
Lema II. > HT, ,,(2, 231, Og...) Bq) = ar 
Also, by (23), 
Hyp (2,234, 4q)---y%p)igna, = 0 (i #1) (25) 
and, by (24) and (25), 
0H, ,,( ©} dy, Oy Am) —0 (#2). 
Cz 2=a2 





Repeating the process, we have 























ok-1 , 
oH, pls 35 Sys Baron at Gm)! $9 (k=1,2,..,m; ik) (26) 
ozk- sina 
and hence, by (26) and Lemma II, 
ORLA, (2, ©; Ay, Bg,..-, Bp) _ (a,—x)"-* } 
aak-1 <a, (n—k)! 
(k = 1,2,....m; n >k) }. (27) 
ak-1HN, (2 xr: a..a 
a Fn(% 25 By» ayer» Gm) =0 (n<bk) 
dz tiie 
Now let G,(z,#3A) = > A"-1H, (2, £3 A, By,..+) Bm) (28) 
It follows from (24), (28) that ' 
anG, - ‘ 
at = AG (29) 
ak-1G, ; 
By (26) Tid =0 (k= 1,2,...,.m; + 4k) (30) 
od 
and, by (27), 
ak-1() 
OG) k-th) (k = 1,2,..., m). (31) 
azk-1 2=ak 
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Hence, solving the partial differential equation (29) with boundary 
conditions (30) and (31), we have 


a a Ven Qa 
where A,, B,,..., K; are functions of x satisfying 
C= A,A%+ B; wh-ledwart t+ K, yfk—IWXm—Dehw"— ray (k # 1), 
eXaj-) — A, eat B; wi-lehwat |. + K, we-Wim—Nedw™—*a;_ 


Hence, for example, 


eXai-2) edwa, : J . eAw"— lay 
1 0 werwas ‘ ; : wm-ledw™— tas 
A, = =— 
0 wm-leAwan ae wml ew™— lam 
where 

era eAwa, : ; q eAw™—tay 
~~ eras werwas ’ : . wm-ledw™ lay 
D(r) = 

eran wm-ledwan, te wml) dw” lam, 


Similar formulae may be written down for the remaining functions 
A,, B;,..., K;. Now it follows from (28) that 


yz 1 G;,(z, x;A) 
o-aj-. 
Co 


where C, is a circle |A| = r < f, and p is the modulus of the root of 
D(A) = 0 which is nearest the origin. Now consider the integral 


1 f Gi(z,x;A) 


27i J =e 
C1 
where C, is a circle |A| = R > , including only those roots of D(A) = 0 
whose modulus is equal to p. Then 


da, 








A, ==) 4 oR, (32) 
p 
where F;, is a function which is bounded if z and s are in any bounded 
region. Thus the expansions (28) are valid for |A| < #. 
Theorems analogous to Theorems III, IV, V may now be established. 


For example, the analogue of Theorem ITI is 





ry 
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THEOREM VI. Jf f(z) is analytic and satisfies 
lim p~” max | f™(z)| = 0 (33) 


n—> co eel 
uniformly on a rectifiable curve T on which ay, dg,..., Am lie, where p is the 
modulus of the root nearest the origin of D(A) = 0, then a sequence of partial 
sums of the Gontcharoff m-point series (19) converges uniformly to f(z), in 
the interior of the circle of convergence for f(z), with centre any point of I. 


For the remainder term 


z 2(m—1) 
Tm (2) rigs | a f de” ... | fO() de 
a ag an 
m = _ 
~ 2, Hi, (2, ©} Ay, Ag,.-5 Uy) f (8) ds 
i A 
ai 


may be written 


m 2 

T, (2) = Sf H,,, (2,8) f(s) ds+ | 34 A, (2, 8)f™s) ds, (34) 
and the argument proceeds as for Theorem III, using (32), (33), (34) and 
Lemma IJ. It may be noted also that sharpened forms of the classical 
theorems on the Gontcharoff m-point expansions [see (3)] are now avail- 
able by the same process as followed by Boas (1) and Gurin (4) for the 
Lidstone series and its generalization. 


6. The case m = 2 has been studied by J. M. Whittaker (8), I. J. 
Schoenberg (7), and others, and it may be of interest to note the special 
forms taken by some of the above formulae in this case. With a,,,,, =a, 
d,,, = b (all n) and writing K, A,,(z,2,a,b), B,(z,2,a,b) for 


m/2(a—b), H,,,(z,2,a,6), Hy,,(z,x,a,b) 


the remainder of the series after n terms is 


- 
Zz 


A,,(z, x, a,b) f(a) da+ [zB 3, (z, xv, a,b) f(x) da 


a b 
where 
=% 4( —])in 
A,,(z,x,a,b) = —_.— cos K(z—b)sin K(a—x)+0O sxit) (nm even), 
; ) = 
his net) = 8 ~~ ee 
A,,(2,%,a@,0) = — ~ant cos A (z—6)cos A (¢ amt) 


(n odd), 
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B,(z,2,a,b) = “Sar sin K(e—a)008 K(b—2)+0( so +) 
ee ak [3K] 
“ (n even), 
_ 4(—1)Kn+) 
B, (2, x, a,b) = Sanaa — Sin K( z—a)sin K(b— —2)+ O(a +4) 
(n odd). 


Also the generating functions (28) reduce to 
eXa—2) cosh A(z—b) 
cosh A(a—b) 
eXb-2) sinh A(z—a) 
cosh A(a—6) 


G,(z, x, A) — 








and G,(z, 2, A) = 
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AN ITERATIVE PROCESS FOR THE 
NUMERICAL DETERMINATION OF 
CHARACTERISTIC VALUES OF 
CERTAIN MATRICES 


By 8. N. AFRIAT (Ozford) 
[Received 8 February 1950] 


1, Introduction 
In general terms, when a diagonal element of a Hermitian matrix is 
sufficiently well separated from the remaining diagonal elements, by 
the sums of the moduli of non-diagonal elements in respective rows or 
columns, there is associated with it a characteristic value of the matrix 
which may be determined indefinitely closely by the iteration to be 
described. Thus, in particular, when the non-diagonal elements of the 
matrix are sufficiently small compared with the intervals between the 
diagonal elements, then all the characteristic values may be so deter- 
mined. 
2. The Process 

Let a be a Hermitian matrix of order n and let a condition R, on its 
elements be given by the inequalities 


R,: |a,,—@,,,,| m 2 2 ldyel+ > lux (wu Fv). 
KAY KF 


The x circles 
C,: |2—Ay| < > |ay_el (v= 1,..., 0), 
KFV 
which we may call the diagonal circles of a, each contain a characteristic 
value of a (1); and, if the condition R, holds, then the circle C, is 
disjoint from the rest and contains a single characteristic value, say 4,, 
of a in its interior. 

Let A,,, be the principal square submatrix of a obtained by omitting 
the vth row and column; then its n—1 diagonal circles Cw (u ~ v), 
containing the characteristic values of A,,, are each contained in a 
diagonal circle of a: thus C,,, is contained in C, (u ¥ »). 

For any A in C, and a characteristic value ’ of A,,, as a consequence 


of R, we have 
’ A—2'| > a,,|. 
AA] > & ye! 


Or, if w is a characteristic value of A,,—AI for A in C,, then A+ is a 
characteristic value of A,,, and |w| > > |a,,|. 
K#Y 


Quart. J. Math. Oxford (2), 2 (1951), 121-2 
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Write a(A)=a—A, A, (A) = A, Al; 


then taking the Cauchy expansion of the determinant |a(A)| with respect 
to the vth row and column, 


|a(A)| wes |A,,( A)|{a,,—A— a, A i A)a,y}, 
where a@,, a,) are the vth row and column ule” of a with their 
vth elements eee 
Now |A,,(A)| 4 0 for A in C,; and, since |a(A,)| = 0, we have 
é,, Aa, = a A,,*(A,)a a,), 
A;(A) denoting the reciprocal of A,,(A). 
Further, we have 


A; (A)a,) = a = A," (A)a,) 


a,{—At0)(5 

= a, A;,*(A)a,) 
Also, if w is the characteristic value of A,,(A) of smallest modulus, then 
w~® is the largest characteristic value and so is the bound of A;,?(A). 
Whence, for A in C,, 


0 < aA, A;,?(A)a,) <a - a,) 
< 


-2 
(3 laonl?)( 3 level) <2 
KFV 
by the Hermitian character of a, and Schwartz’s inequality. 
It follows, for the sequence A” (n = 1, 2,...) defined by 
ra = ay, a, — AMD = & ARAM )a,), 
that AM +X, (n>). 

Thus, whenever a condition R,, for some v, holds for a, a charac- 
teristic value A, of a is calculable to any desired accuracy by computing 
successive terms of the sequence A”. The initial term A? may in fact 
be anywhere in C,, and the error in the nth term is at most 


((& lavul?)/(_& ltoul)']” Ire 
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THE WORD PROBLEM IN A GROUP EXTENSION 
By W. H. COCKCROFT (Aberdeen) 
[Received 27 February 1950] 


1. Introduction 
Let EZ be a group extensionyt of a group Q by a group K: that is to 
say, there is an invariant subgroup K of £, and a homomorphism 
7: E> Q mapping Z onto Q with kernel K. Suppose that Q and K are 
given as abstract groups in terms of generators and relations. The main 
purpose of this note is then to give a set of generators and relations for 
E and also, under certain conditions, to give a finite construction for 
the solution of the word problem in £. 

An application of the result is given. Let X be a topological space 
with an Abelian fundamental group y which has a finite number of 


generators. Let X* = X4-824+...4 82, 
where H? (i = 1, 2,..., m) are oriented 2-cells bounded by oriented 


circuits in X and not meeting X or each other in any point internal to 
them. The second relative homotopy group 7,(X*, X) of this space is 
effectively calculated{ by the methods of the note. 


2. Outline of results 

Part I. I shall take EZ (and hence K) as an additive group, although 
it will not necessarily be commutative, and Q as a multiplicative group. 
Words in the generators of an additive group will be written as sums, 
with the symbol 0 for the empty word. In a multiplicative group a word 
will have its normal meaning in the context, with 1 representing the 
empty word. Elements of the abstract groups will be regarded as 
equivalence classes of words w in the (‘symbolic’) generators, and will 


be written w in Part I. I shall use the normal convention of w, = wy, 


for the equivalence of words and w, = W, for the equality of elements. 
I recall that, by choosing representative elements u(x) € £ for all 
xé€Q, with u(x) €7~1(x) and u(1) = 0, we may define ‘normalized’ 
factor-sets h(x, y) of Q in K, and automorphisms « of K, by 
h(x, y) = u(x)+u(y)—u(xy), = a(x)k = u(x)+k—u(x) 


Tt See (1) § 2 for the general definition of a group extension. Compare also 
the generators and relations defining a group extension in (2) § 1 (Theorem 1 in 
particular). 

t For a precise definition of effective calculation see (6) 1237, Note ITT. 


Quart. J. Math. Oxford (2), 2 (1951), 123-34 
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for all x, yc Q and ke K. We may also define a function f of the 
generators x; of Q, with values in EZ, by setting f(x) = u(xé), for all 
generators x; of Q@, e= +1. This definition may be extended by 
linearity to words in these generators and, if R, = 1 is any relation of Q, 
it is trivial that f(R,) € K. 

We assume that definite words are chosen in the classes «(x{)k; and 
f(R)), for all generators x; of Q and k; of K, and all relations R, = 1 of Q. 
These words, which are given in terms of generators k; of K, will be 
denoted by W(x{,k;) and W(R)), respectively. By means of them we 
construct in § 3 a symbolic extension E* of Q, with associated homo- 
morphisms 7,: H* > Q and A,: K = 7;1(1). This extension Z*, which is 
defined in terms of generators and relations, is proved in § 4 to be 
equivalent} to Z, in the sense that there is an isomorphism 9: E* > E 
(onto Z) with 79 = 7,, 6A, = A (A being the identity isomorphism of K 
into £). 

When XK is Abelian, Z may be defined{ in terms of an operator 
homomorphism, into K, of a free group F, which contains a normal 
subgroup R such that F/R ~ Q. In §5 1 show that in this case the 
function f can be determined by this operator homomorphism. 

In § 6 a solution of the word problem in £’ is given if it is solved in K 
and (constructively) in Q. By a constructive solution of the word prob- 
lem in a (multiplicative) group is meant a rule for expressing any word 
which is equivalent to the empty word as a word of the form 


iy. WwW, 


where W is any word in the group, R = 1 is any relation, and « = +1. 


Part II, The groups Q and K may arise as abstract groups, as above, 
when Z£ itself is initially given in terms of generators and relations. 
Thus a representation of Q may then be obtained by adding relations 
to those of H#, and generators and relations of the kernel K of the 
homomorphism 7: EH - Q may be calculated by the methods of Schreier 
in (3). The symbolic extension H* then gives a new representation for £, 
and, if the word problems can be solved as required in Q and K, we can 
effectively calculate £. 

In Part II below we consider from this point of view the free crossed 
(y,@)-module p which is an abstract group isomorphic to the group 
m(X*, X) defined in §1. We shall use the definition of a free crossed 


+ Compare the definition of (Q, K) equivalence in (1) § 11. 
t See (5), § 9, Theorem A’. 
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module given in (4) [§ 2, 455]. The notation used will be that of (4) and 
the convention of using symbols w to denote elements of an abstract 
group, in order to distinguish them from words w in the generators, will 
be dropped. 

The group p is an extension of p made Abelian by the commutator 
subgroup of p. Thus effectively to calculate 7,(X*, X) 

(a) we give generators and relations for p made Abelian and effec- 

tively solve the word problem in it; 

(b) we give generators and.relations for the commutator subgroup 

of p, and solve the word problem in it; 

(c) we construct the symbolic extension p* which is equivalent to p 

and solve the word problem in it by the methods of § 6. 
Finally, as p is a group with operators in y, we calculate the operation 
of y on p*. 
Part I. THE SyMBoiic EXTENSION 
3. Definition of E* 

Let the generators of the group Q be the symbols (+-,2;) with their 
‘inverses’ (—,x;,), written 2;, x71 (i = 1, 2,...) and let its relations be 
Ry(x;) = 1(A = 1, 2,...). Let the generators of K be the symbols (+, k;) 
with their ‘negatives’ (—,k;), written k;, —k; (j = 1, 2,...), or more 
briefly ek; (e = +1). Let its relations be S,,(k;) = 0 (u = 1, 2,...). 

The symbolic extension £* is then an additive group generated by 
symbols (+, X;), (+, K;), (—, X;), (—, Kj), written X;, K;, —X;, —K;, 
where the aggregates {X,} and {K;} are in one-to-one correspondence 
with the aggregates {a,;} and {k;} respectively. To obtain the relations 
of Z* we define a map o of the generators of K into those of £°, 

o(ek;) = eK; (€ = +1), 
which is extended by linearity to words in the generators. Let the 
images, under the map o, of the words W(R,) and W(x§,k;) be f, and 
af; respectively. The relations of Z* are then 


R\(X,;) =f, (for all A), (1) 
S,(K;)=0 (for all p), (2) 
eX,+K;—eX,; = of; (for all i, j; « = +1), (3) 


where R)(X;) is written additively. 

Since S,,(A;), for all u, are among the relations of H*, the map o deter- 
mines a homomorphism A,: K -> E* (into Z*). Similarly a homomorphism 
t,: E* > Q (onto Q) such that 751(1) = A, K, is defined by 

T.(eX;) = Xf, 7,(eK;) = 1 (¢= +I). 
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4. The equivalence of E* and £ 

The extension H may be constructed} as a group of pairs (k, x) for 
all k e K and x € Q, with addition defined by 

(k, x)+(m, y) = (k+o(x)m-+ h(x, y), xy), 
and zero element (0,1). The correspondence k <~ (k, 1), for all k € K, 
defines an isomorphism A: K > E (into Z). Similarly the homomorphism 
7: E > Q (onto Q) with kernel AK is defined by 7(k, x) = x. The auto- 
morphisms of AK are defined by taking u(x) = (0, x), for all x € Q; thus 
a(x)(k, 1) = (0, x)+(k, 1)—(0, x). 
Define a map @ of the generators of H* to elements of EZ: 
6(eX;) = (0, x§); 6(eK;) = €(k;,1) (for all 7, 7; « = +1). 
Let @ be extended by linearity to words in the generators of H*. It may 
be verified that @ is consistent with the relations of #*. For instance, 
suppose that R,(X;) = 0 is given by 
€,X, +6,X,.4+...t¢e,X, =O (e,= +1). 
Then we have 
6( Ry(X,)) = (0, xf) + (0, xf)+...+(0, xf) 
= (u(xf)+ u(xf)+...4+ u(xf*), xf xf...xfr) 
(for h(x, y) = u(x)+u(y)—u(xy)) 
= (f(R), Ry(x,)) 
== Of,. 

Thus 6 determines a homomorphism 6: H* > E (into Z). By definition 
70 = 7,, 04, = A and hence 9: E* = E, and A;>1(0) = 0 [cf. (7) 435, 
Lemma 3]. Thus £' is an extension of Q by K and 0 is an equivalence. 
5. The symbolic extension when K is Abelian 

Let K be Abelian and let Q be regardedt as a factor group F/R of 
a free group F, which is freely generated by the symbols {z;}. By 
Theorem A’ of (5) the extension ZH may be defined by means of an 
operator homomorphism of R into K. Indeed, given the factor sets 
h(x, y), this homomorphism is defined as g|R where g is determined, 
inductively, by the equations 


g(x x;) = g(x)+h(x, x,), 
g(x x1) = g(x)—h(x x;1,x,), 


g(1) = 9, 
Tt See (1) § 8. 
t Compare Kurt Reidemeister, Linfiihrung in die Kombinatorische Topologie, 
Ch. 2, § 5. 

















THE WORD PROBLEM IN A GROUP EXTENSION 127 
where x is any reduced element} of F. Using the definition 
h(x, y) = u(x)+u(y)—u(xy), 
it is easily verified that g|R = f|R, where f is the homomorphism of F 
into E defined by f(x;) = eu(x,). But, as is readily verified, the function 
f of § 2 may be defined in this way when K is Abelian, in which case 
§: H* > E is altered so that 0(eX;) = (0, 2;) and the proof of § 4 remains 
valid. 
Thus, when K is Abelian, the words W(R)) of § 2 may be defined as 
representative words of the classes g(R)), where g: R > K is the appro- 
priate operator homomorphism associated with £. 


6. The word problem in E* 
I define two operations on words in E°. 
Operation (i) Given a word W in E of the form 
U+eX;+K;+V («= +1), 
take the generator «X ; to the right of the generator K;, by using relations 
(3) of #: thus W = Utos+eX,+0; 

Operation (ii) Suppose that by the reduction processes of Q a word 
in the generators x; of Q may be reduced to the empty word and that 
part of the reduction process is 

WRWAeWW (ce = +1). 
Then given the same word in the generators X; of H* we have, by 
relations (1) and (3) of £°, 
W+eR)+W, = Wt+efpt+W, = U+W4+W, 
where U is a word in the generators K; obtained by taking the generators 
in W, to the right of those in f, by operations of the type (i). Thus a 
reduction in Q may be ‘copied’ in E£*. 

Given any word in the generators of H*, say W, we transform it by 
operations of the type (i), in a finite number of steps, to the form W,+- WW, 
where W, is a word in the generators K;, and W, is a word in the genera- 
tors X;. If the word W, in the generators x; of Q is not reducible to the 
empty word, then neither is W, in the generators X; of H*%, for 7, is a 
homomorphism. Otherwise, we reduce W, in Q and ‘copy’ the reduction 
in E* by operations of the type (ii). W is thus reduced to a word W,+U 
in the generators K;. By the solution of the word problem in K we 
know whether such a word is reducible to the empty word by use of 
relations (2) of H*. Thus we have a solution of the word problem in E’. 


t i.e. an element in which no formal cancellation of generators and their 
inverses is possible. 
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Part II. Tot ErrectivE CALCULATION OF 7,(X*, X) 


7. Definition of the free crossed (y,d)-module p 
Let y be an Abelian group with a finite number of generators. Let 
{a;} (¢ = 1, 2,..., m) be a finite aggregate of symbols with each of which 
is associated an element 2x; of y. The free crossed (y,d)-module p is 
then an additive group whose generators are written (x, «;), —(x, «;), for 
all elements x € y and all 7. Its relations are 
(x, a )+(y, aj) —(2, o;) = (waxy, o;) = (2%; Y, o;) 
for all pairs of elements 2, y c y and all values of 7, 7, together with the 
‘trivial’ relations. 
It is proved in § 2 of (4) that such a group satisfies the properties of 
a crossed (y,d)-module, i.e. 
(i) p admits y as a group of operators; 
(ii) there is a homomorphism d: p > y such that 
d(xa) = x(da)x-1 (AE p, Ey); 
(iii) a+b—a = (da)b, for every pair a,b C p. 
I say that, in consequence of the fact that y is an Abelian group, 
p has the following further properties: 
(iv) the commutator subgroup |p, p| of p is in the centre of p; 
(v) the commutator [a, b| is bilinear in a and b, and 
[—a, 0] = [a,b] = —[a,0] 
for all pairs a,b c p; 
(vi) [xva, yb] = [a,b] for all a,bc p, and all x,ycy. 
It follows from (iii) that the kernel d-1(1) of the homomorphism 
d: py is in the centre of p. But 
d{a,b] = (da)(db)(da)-1(db)-1 = 1. 
Hence [p, p] c d-1(1) and (iv) is proved. We also have 
[a,b+c] = a+b+c—a—c—b 
= (da)b+[a,c]—6, by (iii), 
= (da)b—b+[a,c], by (iv), 
=[a,b]+{a,c], by (iti), 
and [a+b,c] = a+b+c—b—a—c 
= a-+[b,c]—(dc)a, by (iii), 
a—(dc)a+[b,c], by (iv), 
= [a,c]+[}, c], by (iii). 
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Thus the commutators are bilinear. Also 


[—a,b]+[a,6] = [0,6], by (v), 
= 0, 
and hence [—a,b] = —[a, b]. 
Similarly [a, —b] = —[a, b]. 


Finally we have 
[xa, yb] = xa+yb—xa—yb 
= {d(xa)}yb—yb, __ by (iii), 


= (da)yb—yb, by (ii), because y is 
Abelian, 

= a-+(yb)—a—yb, by (iii), 

= a—{d(yb)}a, by (iii), 

= a—(db)a, by (ii), 

= [a,b], by (iii). 


Thus (vi) is proved. 

It is obvious that p is an extension of p made Abelian by the com- 
mutator subgroup [p, p]. By (iv), p is a central extension. 

The element of p corresponding to the generator (1, «;) is denoted by 
a;, and da; = x;. Thus any element of p is of the form 


€1Y1 Ui, Hen Yo, +. FEn Yn Vins 
where e«; = +1, y; € y and repetitions are allowed. 


8. The group p made Abelian 

Let h: p > p/[p, p| = C be the homomorphism which makes p Abelian. 
Let 7 denote the factor group y/dp. The group C is a free 7-module 
with basis {ha;} [cf. (4) § 2, 457], ie. if ha; = a,;, C is a free Abelian 
group with generators {7d,} for all co-sets  € y and i = 1, 2...., n. 

The group y, which is obtained by adding relations to those of p, is, 
like y, an Abelian group with a finite number of generators. Hence the 
word problem may be solved in 7 and we may calculate the elements 
tev. Thus the generators {%a,} of C are calculated. 

The word problem may obviously be solved in C. Furthermore, it 
may be solved effectively, for any word equivalent to the empty word 
is easily expressible as a sum of commutators of pairs of the generators, 
together with the transforms of such commutators. 


3695 .2.2 K 
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9. The commutator subgroup of p 
It follows from the properties (iv), (v), (vi) of p, that the commutator 


subgroup [p, p| is the Abelian subgroup of p which is generated by the 
commutators [a,,a;]. I shall thus denote the generators of [p,p] by 


+¢;; (,9 = LP : Ee nN; a F J). 
It is obvious that the subgroup dp of y has generating elements {x,} 
(t = 1, 2,..., n) because y is Abelian. Let 


S = a'2)...22 = 1 
be any relation.among these generating elements. The group [p, p] is 
an Abelian group with generators {c,;} as above, and relations 
Cy te =90 (for all c;;), 
n 
> 76; =9 (for all relations S = 1 of dp; j = 1,..., n; i # J), 
i=1 


together with the trivial relations. 
The first relations obviously hold in [p, p]. The second relations also 


hold, for P P 
> r{a;,a;] = [> ra, 45), by (v), 
i=1 i=1 
ez & 
because >) r,a,€d-1(1), 
i=1 


which is in the centre of p. We require to prove that these relations 
form a complete set of relations for [p, p]. 

For convenience I shall no longer distinguish between generating 
elements and symbolic generators of a group. Thus p has generators 
{xa,} for all xe y, i = 1, 2,..., m. Add to p the generators {c,;} and the 
relations 

Cy =a,+a;—a,—a; (t,j7 = 1, 2,....03 4 AJ). 
It follows that [p, p] is now the subgroup of p generated by the subset 
{c,;} of the generators of p. We thus apply Schreier’s theorem [(3) 174, 
§ 3] to prove the completeness of the relations. 

First we define a representative system {u(c)}, u(c) € h-1(c), u(0) = 0, 
for alle e C. By collecting together the generators of C associated with 
the same basis element and carrying out all possible cancellations, every 
element c of C may be expressed in a unique form 

e= (¥ yna,)+..+(F rama) (= £1, Hep), 
j= j=1 
where i; <i, <... <4, and the elements 7, for each k, are in some 
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preassigned order given by an ordering of the elements of 7. Thus, 
taking representatives v(7) in each co-set 9 of y, with v(1) = 1, we define 


wo) = (SPP ag) ++ ( S o1ayha,). 


The Schreier generators of |p, p] are then 
UO, 2a, = U(c)+2a,—u(c+2a,), 
Uney = Uc) +e;—u(c), 
for all c € C and all generators xa,, c;; of p. 
Secondly we define a fixed representative word U*, in the generators 
¢,;, for each Schreier generator U. Associated with the element c € C, 
defined as above, are the integers 


Pk 
L,, = > ® (k = Bi ocs m). 


j=1 
It is easily verified that 
U, 2a, = wa; i—U(& f)a; —l, Cig Laas Cy... — by, Crim? 


where i,_, <i <%,. Define a fixed representative word, in the genera- 
tors x; of dp, for every element 2—10(X) of dp. Suppose that x-!v(#) is 
represented by 

rixy...c8 = d(8,a,+8,4,+...+8, An). 


Then 
| an = [8,a,+.. +S An, CA; i]— z Ly Cit, 
n ™m 
= J Hey t D lela 
j=1 k=q 
Define  — 


n m™m 
T* — 
Ob cu = > 8+ Dd eis 
j=1 k=q 


for all generators xa,;, c;; of p, and all c € C defined as above. 

Any element a €[p, p| can be expressed as a sum of Schreier genera- 
tors, and on substituting U* for each generator U, we obtain a word 
(a)*, say, in the generators c;;. By Schreier’s theorem a complete set 
of relations for [p, p| is given by 


(u(c)+ R—u(c))* = 0 


for all c € C, and all relations R = 0 of p. 
Since |p, p] is Abelian, these relations will imply 


Cis +Cpp—Ciz—Crg = O, 
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for all c;;, ¢,,. In what follows I shall assume these commuting relations 


to be given. 
then 


(u(c)+ R—u(c))* = US .yt+ Uaj+ Ut +a,ai— Ue +a,aj— Ueav 

which is easily verified to be equivalent to 
Cytey, (¢ > J); 

or Cy—Cyy (<j), 

by using the commuting relations of [p, p]. 

Again, using the commuting relations of [p, p], if R = 0 is 
LA; + YA;—La,— x, ya, = 0, 
we have 
(u(c)+ R—u(c))* = Ue 2a + Ut ta,vay— 0 +94),20i— Oe rway 


which is easily verified to be equivalent to 
n n 
C Chs— Chi, 
ut 2 Pe ki PT ki 


where y~1v(g) is represented by xf'...7?* and x; 1y-1v() is represented by 
xt...2%, But these representatives imply that 


P21, 2 Pt—ae+l, aPa—dn =] 


is a relation of dp. Moreover, 
(w(c) + R—u(c))* = (py— yey +. + (Pi— Get Cig +++ + (Par— Ue) Cay: 
Thus the relations of [p, p] given above are complete. 


The word problem may obviously be solved in [p,p], for it is an 
Abelian group with a finite number of generators. 


10. The calculation of the equivalent extension 

Having calculated C and [p, p], we can calculate the symbolic exten- 
sion p* of C by [p,p], which is equivalent to p. Since [p, p| is Abelian, 
the extension may be defined in terms of an operator homomorphism. 
Let F be a free group with generators {za,} (Gey; i= 1, 2,..., n), 
and let C ~ F/R. The operator homomorphism f: R > [p, p| which is 
associated with the extension is defined by f = g|R, where g: F > pis 
given by g(%a,;) = eu(@a;) (e = +1) [ef. § 5 above]. 

Since [p, p] is in the centre of p, the inner automorphisms of p leave 
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elements of [p, p] unaltered. Thus, following the definition of Z', p* is 
given by 

Generators: C,;, XA,, for all c;;, #4,, respectively. 
aa a aie Oe C.. } j 
Relations: XA,+¥YA4,—X4,—YA, = yw WFD 
.. ez 
C;+C;,=9 (for all C;;), 
> r,C,;=0 (forall relations S = 1 of dp; 7 = 1.,,...,n; + # J), (2) 
i=1 


C;+C,,—C,;—C,, = 9 (for all C;;, C,,), 

XA,+0,,—XA, = (for all C;,;,, XA;), (3) 
together with the trivial relations. 

The word problem can now be solved in p* by the methods of § 6. 


ij 2 


11. The operation of y on p* 
To define the operation of y on p*, we explicitly define the equivalence 
¢: p = p®, and then define the operation by 


wa = $(ap-(a)) (wey, ae p’). 

The extension p may be defined as a group of pairs (k,c), for all 
ke|[p,p| and ce C, with the appropriate addition formula, just as Z 
was so defined in § 4 above. In fact p may be identifiedt with this group 
by an equivalence 7 defined by 


i(ku(c)) = (k,c) (ke[p,p], ce C), 
where u(c) € h-1(c) is a chosen representative of the element ce C. If 6 


is the equivalence, defined as in § 4 above, between the extension ip and 
the symbolic extension p*, then we define ¢ = i: p = p’. 


12. The kernelt of d: p> y 

The kernel of the homomorphism d: py can be calculated by 
Schreier’s methods, and p can then be constructed as an extension of 
dp by d-1(1). This method is more complicated than the one given 
above which uses |p, p] and C. However, we may now calculate d-1(1) 
directly. 

Let the relations among the generating elements {x,} of dp be reduced 


to an independent set 


or) me) 7 = 
2 te 0° =1l (fj = l,..., p), 


the rows of the matrix [rY] (¢ = 1,..., n; j = L...., p) are linearly 


Tt See (1) § 8, 334. 
t d-(1) = 2,(X*) if 7,(X) = 0. 
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independent over the ring of integers. Associate with each such relation 
a symbol 6;. 
Using the chosen representatives v(#) of the co-sets % of 7, associate 
with each element v(%)a,;—a,; ( 4 1) a symbol [Z, a,]. 
Let A be the free additive Abelian group generated by the symbols 5; 
and [Z,a,], together with their ‘negatives’. 
The kernel of d: p > y is isomorphic to the direct sum A+[p, p]. 
Define ¢: A+[p, p] > p by 
$(3;) = Pa... +a, 
$[Z, a,] = v(#)a,—a,, 
$(C4j) = (4%, a5]. 
It is easily verified that ¢ is a homomorphism into d-1(1). It. is an 
isomorphism, for obviously ¢ ||», p] is an isomorphism, and, if 


$( dM d:t D red z, a,]) me 
v Zt 
where p;, vz; are integers, then making p Abelian, we have 
n << ies es 
> DL MepPast > veskd,— ¥ ve,4; = 0. 
tj=1 Zi £,t 
Hence vz; = 0 and > v4? = 0(j = 1, 2,..., n). But the rows of the 
v 
matrix [r{”] are independent. Therefore »; = 0, for all i. 
The isomorphism ¢ is onto d-1(1), for any element of d-1(1), by a 
simple rearrangement of terms (using the fact that d-1(1) is in the centre 


of p), can be expressed as a sum of the images under ¢ of the generators 
of A and [p, p]. 


I should like to thank Professor J. H. C. Whitehead for his helpful 
criticism of both exposition and content of this note. 
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INEQUALITIES WITH NON-CONJUGATE 
PARAMETERS 


By F. F. BONSALL (Newcastle-upon-Tyne) 


[Received 30 March 1950] 


1, Let 
p>1, qg>1, I/pt+i/p’=1, Igt+l/q =1, Iptl/q>1, 
so that 0<A= 2—1/p—1/q = 1/p’+ 1/7 <1. 


Let functions f, g, etc. be non-negative. The parameters p, q are 
conjugate if 1/p+-1/q¢ = 1 (and consequently A = 1) and otherwise are 
non-conjugate. 

The essence of the present paper is the apparently trivial observation 
that, since 1/q¢’+-1—A = 1/p, we have 


[ Afg dex = [ (uf) (vgnye” (uf 292) der, (1.1) 
where wu, v, w are at our disposal subject only to the condition 
Ula ylP'ywi-A — , 
Now, since 1/¢’+1/p’+1—A = 1, Hélder’s inequality may be applied 
to (1.1) and yields 


| kfg dx < ({ uf? dex)" ( | vgt dex)'” ( | wf Pg? dz). (1.2) 


In spite of its trivial appearance, (1.2) is useful for investigating 
inequalities involving non-conjugate parameters. Results correspond- 
ing to (1.2) clearly hold also for multiple integrals and for series. 

The technique outlined above is applied in the first place to the 
extensions and analogues of Hilbert’s double series theorem due to 
Hardy, Littlewood, and Pélya (1), Levin (2), (3) and I. Schur (4). In 
all cases it yields more elegant proofs and in some cases new or sharper 
results; § 6 contains some remarks on the definition of equi-measurable 
rearrangements of functions. 

Another typical application is a proof of Jensen’s extension to non- 
conjugate parameters of Hélder’s inequality for series 

¥ a,b, < (> azyr( > 62)". (1.3) 
In this case a slightly stronger inequality than (1.3) is obtained, and 


some light is thrown on the domain of validity of the corresponding 
inequality for integrals. 


Quart. J. Math. Oxford (2), 2 (1951), 135-50 
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In § 11 the technique is generalized to inequalities with n parameters 
and as an example an extension of Hilbert’s theorem with three non- 
conjugate parameters is obtained. 

The remaining sections, § 12 and § 13, are rather loosely connected 
with the rest. § 12 is concerned with the ‘sharpest’ form of Hilbert’s 
double series theorem. It is shown that, if the theorem for series is 
deduced in a perfectly straightforward way from the corresponding 
theorem for integrals, we obtain 


oO 


1 
> | mits 3(m-n—1)** 3. ce yr | 








mn=1 
= Up [<= 1/p’ 
< 7 cosec— =( 4 ( ox) ‘ 

2%) \2 
A part of this result is due to Levin (3), who, however, obtained it in 
a much less direct way. A similarly sharpened form of the theorem with 
non-conjugate parameters is also given. § 13 contains a proof of the 
known result that 7 cosec(7/p) is the best possible constant in Hilbert’s 
theorem when the parameters are conjugate. 


2. Let f(x) € L°(0,00) and g(y) € £%(0,00) and write 
[fe@)dx=F, [gly dy= 4. (2.1) 
0 0 
It was proved by Hardy, Littlewood, and Pélya (1) that 


| (= (7)9( fe ndedy < < K(p,q)F¥Qua, (2.2) 
0 0 


Their proof, however, threw little light on the value of K(p,q). This 
situation was improved by Levin (2) who proved (2.2) by a method 
involving only repeated application of Hdlder’s inequality and obtained 
the result 








K(p,9) < (reosee,) (2.3) 


This result, although probably not best-possible, is interesting since it 
reduces to the known best-possible value when p, ¢ are conjugate. The 
technique of the present paper enables us to obtain this result by a 
single application of Hélder’s inequality. 











Lo al 
‘ 


SO DB Tl ft 


ad 
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THEOREM 1. 
| | bie dady < (cose) Fureqgua,+ (2.4) 
0 0 
Proof. 
r=( | —% aa 
J ir +ypon" 





fen e (t yg ay 


< Te Tye 1. 
Here 
eo 


| f7@)gly) dady = PG, 


=f fool 
i 
0 


I, = | 





r r —t/p’ 
f?(x) dx | ae on setting y = xu, 
U 
0 


= mcosec(zt/p’) F, 
I, = mcosec(nt/q’)G. 


Combining these results we get 
Iq’ 1/p’ 
I< (- cose] (= cose") Furqua, (2.5) 
Pp 


Now ¢ is at our disposal, and it is a simple matter to show that the least 
value of the constant here is obtained by taking ¢ = 1/A, when (2.5) 
becomes (2.4). It is interesting that the smallest constant obtained in 
this way coincides with that obtained by Levin, but there is still no 
evidence that it is best-possible. 


3. In a recent paper (5) Stetkin showed that the series analogue of 
Theorem 1 is a consequence of the following general lemma: 


{ All such theorems contain implicitly the statement: if f « ZL? and g e L*, then 
the integral on the left exists and satisfies the inequality. 
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LemMA. Let > > din%mYn Ad YY Opn %mYn be positive bilinear 
forms with respective bounds K, and Ky in [1/o4,1/B;] and [1/ag, 1/B,] 
(O<a,<1;0<8; <1). Leta=koytkyay, B = k,B,+kyB. where 
0<k; <1,k+hk, = 1. Then > > aks, bk, 2m Yn ts bounded in [1/c, 1/B] 
with bound KK, 

Stetékin does not consider whether K,, K, being exact bounds implies 

that KK is exact. That this is in general untrue may be seen by 
considering a simple example. 


Let a, = 1, Ann = 0 otherwise; 
by, = 5=1, Onn = 0 otherwise. 
Then K, = 1, K, = 2!-*:, so that K#:Kk* > K where K = 1is the bound 
of FY Ahin Orin Xm Yn 
4. THrorEM 2. Let K(x, y) be homogeneous of degree —1 and such that 


} u-W' K(1,u) du = k, 


then J } f(a)g(y)K\a, y) dady < FU qua, (4.1) 


This is the generalization to the non-conjugate case of a theorem due 
to I. Schur (4) in the case p = q = 2 and extended to the general 
conjugate case by Hardy, Littlewood, and Pélya [(1) 229]. 


Proof. As in the proof of Theorem 1, 


_ | 9K dady < HT 1, 
00 


where 


I, = FG, 
oOo ow tip’ 
Fiat K(x, y) dady 
0 0 
= J | uP’ K(1,u) du, 
0 
L=G | vi’ K(v, 1) dv. 
Thus t 


re) lig’ ( @ Lp’ 
I< { fue, w) au} { f owKe, 1) av} FirGia, (4.2) 
0 


0 








o5«e 


—_—— 





ut 
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The parameter ¢ is at our disposal; and setting ¢ = 1/A and observing 
that © @ 

[ u-Wr'K(1,u) du = | v-V' K(v, 1) dv = k, 

0 0 
we obtain (4.1) from (4.2). This choice of ¢ is the best possible; for, by 
Hoélder’s inequality, 


~y /Aaq’ 
uP’ K (1, u) au} {f 


L/Ap’ 
vi K(v, 1) ae| 
0 0 
~ 1/Aq’{ @ 1/Ap’ 
= | w-»’K(1, w) au} {f u’-1K (1, w) au} 
0 0 


x 


> | {u-’K(1, u)}V full’ AK (1, aw)?’ du = k. 








0 
5. THEOREM 3. 
FF fla)gly) (2, 1) [ secewe ye snag)! 
dady < B’|—,—) F'VqGir’ P(x y dad , 
j [eee ” ng PY ey yoy 
00 0 0 
The constant BA = i u i nue is best-possible. 
( T(1/p’+1/9’) 
This result also i: (2.2), since 
aya’ _ (1/p’”'(1/q’P 
e via! 2 2s 0 2 1 
(@typ S yy ik 
In)’ \Alp’ ’\1/q’\1-A 
but the constant (" P’) a ) “| BA 


obtained in this way is not in general smaller than that in Theorem 1. 
Proof. 


= f [P20 aay 


x yy 
0 0 


[EQ eal 


0 0 
gt a\-l/p 1 < 1/p’ . a 
7 lo iG) rT aioe: ~~ 


. 








<— J1lq' J1/p’J1-A 
< [Va Tye", 
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Hi oO wo 
= I { [r% only) dandy 
= x ’ 
J J (x+y) 
~ r f f(x) y\-“4 dady 
wit J gi () (a+y) 
a du : 3 
= dx ee 
= fm () =| Gas? BS 7) 
has ox (1, : *): 
q 


It remains to prove that B’ is the best possible constant in this theorem. 
For this purpose, with 0 < « < 1, set 

f(x) = x-¥P in (e,1/e), f(x) = 0 elsewhere, 

g(y) = y-™@ in (e,1/e),  g(y) = 0 elsewhere. 
Then —— 


ates: eg 
one J wry * 
PA : 
7 1+y/x) x 
de | * yll’=1 
-{% (I+uy 
e/x 


= 2og(9B— R,— R3. 


e/x 


e-1 e/x 
sa dx yal ws — 
Role) aa <{# PEE ey ae 
€ 0 


e-! ro) 
dx uvg-1 e 
i= i >. - -Ip’-1 Jy — 7'2(1—e2P’) — 
R= [5 [eo we(t fu P-! du pr e?P’) O(1) 
€ 1/ex € l/ex 
Thus I = 2log(1/e)B+ O(1), 


F= fre ) dz = ‘é = 2log(1/e), G = 2log(1/e), 


ey dxdy = 21og(1/e)B-+0(1). 
a-|[% ety NY = Ploaltle 
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It follows that 
BAF Ge’ Ti = 2log(e!)B+0 flog(e-)}, 
and the theorem is now completed by letting « > 0. 


6. In the following sections, use is made of the equi-measurable 
rearrangement of a function in decreasing order. Let ¢(x) be a measur- 
able non-negative function of L(0, 00); then the measure M(y) of the set 
in which ¢(x) > yis finite for y > 0 and decreasing. Hardy, Littlewood, 
and Polya [(1) 276] define the decreasing rearrangement ¢(x) of the 
function ¢(x) by the equation 

F{M(y)} = y 
and remark that this definition is not quite complete. In fact, if d(x) is 


a step function, 4(x) is not defined anywhere by this equation. This 
difficulty may be avoided by making the definition 


d(x) = inf y. (6.1) 


M(y)<x 
If then M(c) denotes the measure of the set in which ¢(x) > c, it can 
be proved by well-known methods that 
M(c) = M(c). 


Consequently the measure of the set in which c, < ¢(x) < c, is the same 
as the measure of the set in which c, < ¢(x) < ¢,. 


7. It was proved by Levin (2) that, if f, g are the equi-measurable 
rearrangements of /, g in decreasing order, then 





© rf 20 4 eal < | fz ea day. 7.1 
Setting = 7 af P(x), y= —. yg(y), 


Levin deduced that 
; 'f (x)9(Y) 4 dy < Iedst-AiAv',yl-A/Aa’ Pua’ GAv’ 7.2 
(x+y) way S ¢ p ; ( . ) 


and that B(1/p’, 1/9’) is the best possible value of k. 


Writing o = $ Bp = . (7.3) 


F’ 
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we see that (7.2) is equivalent to 


r r flea) 4 is B( 1 *) (1-AyAp’gt1-A)Aa’ Fp Gft/a, 7.4 
| [erent < va “_ 


Now although the constant B in (7.4) is exact, it is nevertheless possible 
to assert a sharper inequality of this type. 


THEOREM 4. Let y be the smaller of «, B defined by (7.3). Then 


1 F fle (x)g(y) decd Bl 7) 1-A Pupgia 
J (epy oY SP peg)” 
and the constant B(1/p’, 1/q’) is best-possible. 


This inequality is evidently sharper than (7.4) except in the special 
case when « = f; the constant B is thus best-possible. It is an interest- 
ing feature of this inequality that the term y depends on only one of the 
functions f, g. 


Proof. Suppose, for example, that y = a < f; then 





roe 1p’ gy1/ -A 
< B pugu| | | Pee ae iy) , by Theorem 3, 
0 0 


° [p’—1 1 da 1-A 
sia aad g2(y) dy [af?(a)] ns ‘ rire | “| 
} J () (1+a/y)* y 


A,1-A Fp 1 /p’ wet -_ 
< Bal Fug | f a) tf ea 
0 


= Bal-A Fugu, 


8. The method of Theorems 3 and 4 can be applied to the double 
integral bet Cad 
| Fee dey (0<A<1). (8.1) 


—-O —-@ 





aK aa ot a 


al 


1e 
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Hardy, Littlewood, and Pélya [(1), Chapter X] prove that the bounded- 
ness of this bilinear form is a consequence of the boundedness of 


lees dady, (8.2) 


where f*, g* are restricted to be non-increasing functions. This latter 
result is then obtained in a somewhat roundabout manner. The follow- 
ing theorem establishes the boundedness of (8.2) for all f* and all non- 
increasing g*. It also gives an estimate for the bound. 


THEoREM 5. If g(y) is a non-increasing function and 0 <A <1, then 





[ [f f(x)g st dudy < K Fu, (8.3) 
where K< pra»? St (8.4) 


B — sup ¥9(y) < i, 


Since G S 


the theorem implies that 





, T(1/p’) , T(1/q’ 
K< ra—a{ rai at = M, say. (8.5) 
a f(x)g(y) 
Proof. i= dady < [Va [¥p' J1-A, 
ih a—y|r 


» 2 1 
= J [ w¥a'-1(1—w)-* du + [ v¥e’-1(1—v)-> ae] 


af) 0 


és [5 (=, 1-2) +B(Z,,1-a) 
eg p 





= FM. 
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Similarly L=GmM 
= 
a Up’ ylld’ 
Lk -{{ p oe ded 
r Up'ylla’-1 
< a| P(x da | * y d 


ad ad a 
— BFGM. 


Combining these results, we obtain (8.3) and (8.4). 


9. Hélder’s inequality for series was generalized by Jensen [(1) 29] 
in the following manner: 
If x, B,..., A are positive and «a+B+...+A > 1, then 


¥ a8... < (¥ a)%(> b)F..( IP. (9.1) 
This includes as a special case 
> ab < (X ar)n(d ba), (9.2) 


where p > 1,g > 1 and p-!+q-! > 1. The inequality (9.2) and indeed 
sharper inequalities of the same type can be proved by the methods of 
the present paper. 


THEOREM 6. Let p >1,q>1, p-+q1>1. Let the sequences (4,), 
(0,,) be the rearrangements of (a,,), (b,,) in decreasing order, 








AP ba 
o = sup—", B = sup—” 
yar > 54 

T I 


and let y be the smaller of «, B. Then 


1/a 


ay < {hntap}!-(S ax)” ”(S 64) 


Since y < 1, the inequality (ii) is evidently sharper than (9.2). 








th 


18 
ca 


f 











ON INEQUALITIES 145 
Proof. Suppose that y = « < B; then 


8) 


2) 
2 nb nn => ( (az) ua’ (62) Up’ (a? b4)!-A 


“ 


oO oO oO ba 
where § = 3 aP a <ySarng< a> ey. 
T T - — 2 


This proves (i) and, since 


10. Owing to the lack of ‘homogeneity in >’ the inequality for 
integrals corresponding to (9.2) is not valid. The following theorem is, 
however, in some respects analogous to (9.2). 


THEOREM 7. If 


b b b 
=[fra)dx, G=[gx)de, — | fr(w)g"(a) dx < «FG, 


b 


then fs f(a)g(x) dx < nA Fuqua, 


Here p, q, A satisfy the usual conditions and the proof of the theorem 
is an obvious application of the methods of this paper. A particular 
case of this theorem is of some interest. It is known [(1) 168] that, if 
f, g are oppositely ordered functions, i.e. {f(2,)—f(+2)}{g(21)—g(*2)} < 0 
for all 2,, x, in (a,b), then 


{ de | f?(a)g%(x) dx < | f?(x) dx | g(x) dx. (10.1) 


a a a a 


This leads at once to the following corollary of Theorem 7. 
3695 .2.2 L 
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Corotuary. If f, g are oppositely ordered functions in (a, b), then 





f F(x)g( a y Purana, (10.2) 


11. The present method of proving inequalities with non-conjugate 
parameters may be generalized to the case of such parameters. 
Let i = 1, 2,..., n and let p,, pj, g;, A satisfy 


(a) Pi > 1, 1/p,+1/p; = 1, 
(5) 2 I/pp 21 


(0) A= Spi, 
(dq) 1/q,=A—1/p; > 0 


It follows from these conditions that 


(11.1) 





ate te +7-+1— —r=1, 
mph a oa es ae 
%i PM 


Hence, 
[As Baen dry We = f(y FP)! Uy Re) FRPP. fA de 
< ({ u, f? dx)'™...( [ U, Ba dex) *"*( [ wf Ps. f Pn dx)”, (11.2) 


where %,..., U,, w are at our disposal subject only to the condition 
Uday, yL!dnwl-rA — k, 


To illustrate this extension of the method, I prove Hilbert’s inequality 
with three non-conjugate parameters. 


THEOREM 8. Let p;, Po, pz satisfy conditions (11.1) with n = 3; let 


I = [ po) dz, G = f omy) dy, i= { home) dz. 
0 0 0 


ee 


where k< (r i) r (a) r (a) \. 
Ap;} \Apo]  \Aps 


Then | | {2 f(xigly = 2) dedydz < kF VGH. Uns, 
0 0 0 
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Proof. With «, 8, y at our disposal, we may write 








r= j es Se *) dedydz 
+y+ 
; i] r [ {Fe (y x) ~¥a(2/x a) ~Be “ (eu y)(2/y) — oda / (x/y) =} os” 
ddd (e-+y+2) (c+y+2)? 
2) -Pas(y/2)-20 








S hPr( (z)(x a 2( 3(2 —A 
x= aa " {f?2(a)g?2(y)h?s(z)}'- dadydz 


1/ 1 1 1-A 
< HeBelyel : 





I, —_ [ H f?1(x)g?2(y)h?>(z) dadydz _ FGH, 
0 0 
= ff £etex My 2) —Yn(z ‘«)- —Ba 
a ‘| ee ee 
00 0 
{ rr U-Yuy- —Ba 
- { f(x) du | | wee net dudv = FJ, say. 


0 0 0 


Setting w+-v = &,v = ” we find 





c é- vau( 1 — ny) YE Bary) —Bar 
J,=|\d d 
| 3 (1+ it 

£1-Ba-yn 

af +e ariel -Pas(1—)-2% dy 


0 

= B(2—fq,—y% Bay +791) B(1—Ba, 1—yu) 

= T(8q,+ya,)P(1—Bq)P(l—ya). 
Similarly J, = GJ, and J; = HJ3, where 

Jp = T(ydot-ag2) 0 (1—yq2)P'(1—age), 

Js = P'(ags+fq3)P(1—ags)P'(1—f95). 
Now «a, 8, y are at our disposal, and setting 

1 1 1 

“laa "la *~ le 

and observing that 


<(e ~) = 5 a 
A\d2 | Gs) Ap Ags APs 
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1 l 
L=h=h=T(K \r(ee)P (gg) = 7 sv: 
Ap, Ape Aps y 


We have now proved that 
I < (J, FY" ( Jy G)¥42( J, H)/99( FGH)'- = JAF UiGurrH Ps, 





we obtain 


12. It is well known [(1) 234] that Hilbert’s theorem for series can 
be stated in the sharpened form 


< Am Dn a[o_»\?(< ~ Up’ ed 
D>, mtn < 705 (>, a8) Z Pie (12.1) 


In the special case p = p’ = 2, it was proved by Levin (3) that the wl 
kernel 1/(m-+-n—1) on the left-hand side can be replaced by the larger 


kernel 
(m ++ << 2)m+n—-2(m, +n)m+n 


(m+n—1)%m+n-1) > (12.2) 





log 


without other change of the inequality. Levin’s proof, however, is 
complicated and does not reveal the simple nature of this result: that it 
is a direct consequence of the truth of Hilbert’s theorem for integrals. 

By deducing Levin’s result from the integral theorem we can obtain 
it with complete generality, i.e. for all p in the conjugate case and a qT 
similar sharpening in the non-conjugate case. The result appears in a ok 
more interesting form if we expand (12.2) in descending powers of 
m-+-n—1, when it becomes evident that (12.2) is greater than 











1/(m+-n—1). > 
In fact, writing m+-n—1 = t, we find m: 
(m+n—2)"t+"-2(m+-n)m+ ee, ig 
log (m-+-n— 1)%m+n-1) = (t—1)log(1—1/t)+ (¢+ 1)log(1+-1/t) 
1 1 1 F 

ak as aad OO eee (123) | 5 
THEOREM 9. di 
z 2, etenits Repay eee} th 


m=1n=1 


= = 1/p 
< 7m cosec— +) az, 
PNG 
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Proof. It is well-known (case A = 1 of Theorem 1) that 


F EHOW) aeiy < x coneo|  to10 axl?l [orn axl” 
JJ (e-+9) way S — [7 (x) dx Jo (y) y 


In (12.4) let us make the specialization 





f(a :) = An (m m—l< apa (12.5) 
gy) =6, (n—l<gy<n) F 
Then 
0 0 a Ip {| = 1/ 
Bd S Knndindy < meosee” (> an) (> vn’ ”, (12.6) 
nr n= I 
where 





od ( dady 
Re, _ ees aty 


m-1in-1 
m 





flog(x+ n)— log(a-+-n—1)} dx 


m—1 
= (m+n)log(m-+n)—2(m+n—1)log(m+n—1)+ 
+(m-+n—2)log(m+n—2). (12.7) 


Theorem 9 now follows from (12.3); and by a closely similar proof we 
obtain in the general, non-conjugate case. 


THEOREM 10. 


oe | 2 A(A+1) 2 A(A+1)(A+2)(A+3) 
>. > ea nt TH (mtn—P 6! (m+n—1)4 


«(com So)" 


13. Proofs that the constant mcosec(z/p), in the conjugate case of 
Hilbert’s theorem, is best-possible are known [(1) 232]; but I have not 
discovered in the literature the rather simple proof given below. 


Set f(z) = aeMire-#, gly) = ye-r'ev; 
then, with e > 0, 





+.J 


m=1n=1 
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Hence 
={ pote aah”| f eran anh” = 22,2) (1) (2) 140 
eat if (x) | g”’(y) dy oe r (e) 
(13.1) 
Also 
r r f(x)gty) 4 d <f _— €-1)/p’px+y) dady 
IJ (x+y) ties J J v ; (x+y) 


to) 1 


- | dé | Ele (| — 9 )(e-Wip gle y(e-Wi'e-€ dn, 
0 0 


where x+y = &, y= &n, 


oe) 1 
PEs | &€-1e-€ dé | neip’+ip-1(] — »)</P+ip'-1 dn 
0 0 


' ¢ =e 
= I(e)Bi-+—, —+-]}. 13.2 
(e) (o+5 = +s) (13.2) 


That 7 cosec(z/p) is the best-possible constant now follows at once on 
comparing (13.1) and (13.2) and letting « > 0. 
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ON SYMMETRIC RECURRENT TENSORS OF 
THE SECOND ORDER 


By E. M. PATTERSON (Sheffield) 


[Received 12 April 1950] 

1. Introduction 
In this paper, several general theorems are established for Riemannian 
spaces V,, which admit a second-order symmetric recurrent tensor 7; 
other than a scalar multiple of the metric tensor g,;;. The tensor Tj; 
satisfies the equation ae 
for some vector x;. Spaces of positive-definite metric admitting such a 
tensor are familiar, and it is known that they are reducible.+ This paper 
is therefore concerned mainly with spaces of indefinite metric. These 
are not necessarily reducible; it is proved, however, that a space 
admitting a second-order symmetric recurrent tensor is reducible if the 
matrix (7'}) has at least two distinct latent roots at any point of the 
space. 

It is shown that the existence of the tensor 7;; implies that there is 
a sequence of second-order symmetric recurrent tensors 7{?), corre- 
sponding to powers of the matrix (7'}). Each tensor 7}, being recurrent, 
generates a parallel plane II,. Sums and intersections of the parallel 
planes II, and their conjugates give rise to further sets of parallel planes. 
It is found that the characters (dimensionality and nullity{) of all these 
parallel planes are completely decermined by the elementary divisors 
of the pencil 7;;—pg;; at any point of the space. 

The Riemannian spaces considered in this paper are assumed to be 
covered by a single coordinate system. The components of the funda- 
mental tensor g;; possess derivatives up to any required order. 


2. Recurrent tensors and parallel planes 
A second-order covariant tensor 7;; (i, 7 = 1,..., n) in V, is recurrent 
ij,p Satisfies 


Teen T= Kg Te 


ij,p pi 


if its covariant derivative T; 
(2.1) 
for some vector field «,. In particular, «, may be identically zero, in 


+ See, for example, L. P. Eisenhart, Trans. American Math. Soc. 25 (1923), 
297-306. . 
t By ‘nullity 


Quart. J. Math. Oxford (2), 2 (1951), 151-8 


5) 


is meant the number of null vectors in a normal basis. 
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which case the tensor 7;; will be called evanescent. The fundamental 
tensor of V, is evanescent. A V, admitting a second-order symmetric 
recurrent tensor other than a scalar multiple of g,; will be denoted by 7, 

A set of p independent vector fields A’, (« = 1,..., p) defined over any 
V,, forms a basis for a parallel p-plane, i.e. & parallel field of p-dimensional 


vector-spaces, if i, = AB, Xi, (2.2) 


for some vectors A8;. Parallel planes have been discussed by A. G. 
Walker} and H. 8. Ruse;t{ some of their results are quoted here. 

Let II be a parallel p-plane, and suppose that the vectors Ai, (a = 1.,..., p) 
form a normal basis (i.e. a set of mutually orthogonal vectors which are 
unit when non-null). If r of these vectors, say ri (p = 1,..., 7), are null, 
and p—r are non-null; then the vectors Xj form a basis for a wholly null 
parallel r-plane. This is called the null pari of II, and is denoted by II*. 
Another parallel plane determined by II is the conjugate (n—p)-plane II’, 
a covariant basis for which consists of the n—p independent solutions 


of the equations pe at 


+ 


If II, and II, are both parallel planes, then their sum II,+II, and 
intersection II, n Il, are also parallel planes. In particular I1* = II n Il’. 

Suppose now that the V, is a 7. If the symmetric recurrent tensor 
T,; is of rank p (i.e. the matrix (7;;) is of rank p), there are n—p linearly 
independent solutions of the equations 


TT; = 0. (2.3) 


Because of (2.1), the solutions A‘ form a parallel (n—,)-plane, which is 
said to be associated with T;;. The conjugate of the associated parallel 
plane is called the parallel plane generated by T;,;. 

If the independent solutions of the equations (2.3) can be chosen so 
that they are all non-null and mutually orthogonal, the space admits 
a non-null parallel plane. It is then reducible; i.e. there is a coordinate 
system in terms of which the metric is§$ 


ds* = g g(x”) dx*daP+-gy ,(2”) da\dat 
(a, B, y = 1,..., p; A, p, v = p+l...., 2). (2.4) 


This is always the case when the metric is positive-definite and 7;; real. 


+ Quart. J. of Math. (Oxford) 20 (1949) 135-45. 
t Ibid. 20 (1949) 218-34. 
§ Eisenhart, loc. cit. 
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In a space 7,, any tensor 7't}) of the sequence defined by 


n> 
Te+D) — TOT, TY =T, (v=1, 2...) 


tj 
is a second-order symmetric recurrent tensor, satisfying 
Ti), = Ky TY). 
The process of forming the sequence is equivalent to taking powers of 
the matrix (7'). 

Since 7'\) is recurrent, it generates a parallel plane II,, When the 
parallel plane generated by 7;; is non-null, each tensor 7’) generates the 
same parallel plane. This is not true in the more general case of a 
partially null parallel plane, as will be shown below. 


3. The elementary divisors of T;,;—pg;; 


In § 4 it will be shown that the dimensionality and nullity of the 
parallel planes II, in a space Z, are determined by the number and 
nature of the elementary divisors to base p of T;;—pg,;. Before doing 
this, we show that the elementary divisors at any point are multiples 
of the elementary divisors at any other point at which 7;; does not 


vanish. This enables us to restrict attention to a single point of the 


space. 
Let 6% aes = ],...,) be an ennuple of vectors parallel along a curve C 
in a space 7, . The ennuple components of g;; and 7;; with respect to 


6, are deli by 
Ing = = 9ij 05,9, T.g = = T;; 95, 9h. (3.1) 
Differentiating the second of these along C, we get 
T py, du* ds = T,; ,(da*/ds)6;, , 


a 


and hence, since 7;; is recurrent, 


dT ,,/ds = oT, 


where ¢ = x; da‘/ds. Therefore (Tg) at any point Q of C is propor- 
tional to (7',g) at an arbitrary point P. In a similar manner it can be 
shown that (g,) is constant along C. Hence the elementary divisors 
of T;;—pg;; at any point are multiples of the elementary divisors at any 


other point at which 7;; does not vanish. 


Since |T5—p9i3| = |9ral| T— pi |, 


' ij 


the characteristic roots of T;; are the same as the latent roots of (7%), 
and the elementary divisors of T;;— pg,; are the same as those of T'i— pi. 
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Walker} has shown that, if all the characteristic roots of 7;; are non- 
zero, then there exists a symmetric evanescent tensor of the form 
a(T7,;— pg;;), Where « is a scalar and p is a characteristic root, so that the 
rank of the tensor is less than n. Hence any 7, admits a recurrent 
tensor S;, such that (S}) has at least one zero latent root. 

Denote the matrix (74) at some point of the space by 7’. Then there 
exists a non-singular matrix % such that ¢%-!7'¢ is a matrix of the 
classical Jordan canonical form, i.e. 


ei +) . 4 


E= , a= 
a. ee ee 1 


where a dot denotes a zero matrix or element. The square matrix C, is 
of order o and corresponds to an elementary divisor p’. The matrix U 
corresponds to all the non-zero latent roots, so that |U| ~ 0. Since T 
has the same elementary divisors as C, it follows that the elementary 
divisors of 7,,—pg;; at any point of the space can be written down when 
the form of C is known. 


4. The parallel planes II, 

THEOREM 1. The dimensionality of the parallel plane II, associated 
with T,; is equal to the number of elementary divisors to base p of T;;— pj, 
and the nullity of Il, is equal to the number of non-simple elementary 
divisors. 

In order to prove this, we require the following two lemmas. The first 
is familiar, and follows at once from the form of the canonical matrix C. 
The second is probably well known, but a proof is outlined here because 
most standard works do not include one. 

Lemma 1. Jf s is the number of elementary divisors to base p of T —pl, 
then T is of rank n—s. 

Lemma 2. In the vector space defined by the solutions of the equations 
T,; i = 0, a unit vector of a normal basis corresponds to a simple elemen- 
tary divisor of T;;—pg;;, and a null vector corresponds to a non-simple 
elementary divisor. 

Proof of Lemma 2. Let the symmetric matrices (g;;) and (7;;) at a 
particular point be denoted by G and H respectively. Then there exists 
a non-singular matrix P such that J = P’GP and K = P’HP forma 


t Loc. cit., Theorem 7.1. 
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canonical pair} of matrices, with 


_. 
i eo taf. & 
ee ae a , 
oe eo Gang 
ee ; 


The sub-matrices J,,, K,, are each square matrices of order o,, and 
correspond to an elementary divisor p. If o, is even, the last sub- 
matrices in J,, and K,, are respectively 


(; ') and (+1). 
If o, is odd, they are 
(+1) and (; '} 


In matrix form, equations (2.3) become HA = 0. If yp is a solution 
of Ku = 0, then the corresponding solution of equations (2.3) isA = Pu. 
Since 

Ap GA, = pa PGP uy = we I py 

for any two solutions p, and p, of Ku = 0, it follows that a set of 
solutions p44, [g;---) gs--» Such that 

HaIpg=9 («AB), pep, = 9,41 (4.1) 
corresponds to a normal basis Ai, (a = 1, 2,...) for the vector space 
defined by the solutions of the equations 7;;/ = 0. The vectors p, 
satisfying u,Ju, = 0 correspond to null vectors of the normal basis, 
and those satisfying u,, Ju, = +1 correspond to unit vectors. 

Define p44, fg,... by 

p, = (1, 0, 0,...), 
ga, = (0, 6, @...., 1, ©,...}, 
where the non-zero element occurs in the position 1+0,+0,+...+0,_}. 
Then the vectors u, satisfy conditions (4.1), with np, Ju, = 0ifo, ~ 1, 
and wu, Ju, = +1ifo, = 1. Hence to each simple elementary divisor 
+t Bromwich, Quadratic forms and their classification by means of invariant 
factors (Cambridge Tract No. 3). It is possible to use these canonical matrices 


instead of the Jordan canonical form to prove all the theorems of the present 
paper. 
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to base p corresponds a unit vector of a normal basis for the solutions tl 
of the equations 7;;)/ = 0, and to each non-simple elementary divisor 0 
corresponds a null vector. The lemma is therefore proved. 

Returning now to the theorem, it follows from Lemma | that there 


are 8 independent solutions of the equations 7;;)/ = 0, and therefore ti 
the dimensionality of IT, is s. From Lemma 2, the nullity of IT} is equal 0. 
to the number of non-simple elementary divisors to base p. Theorem 1 d 


is therefore proved. 
THEOREM 2. The parallel plane II}, associated with T'}) is of dimen- 
sionality p,—p, and nullity p*, where 
Po = Cg41t 2g 42+ Bly gt ores 
Pe = Cost 2gigt-+-+O(Cgg+lsg+a +++)» . 
and e, is the number of elementary divisors p°. 








This theorem is proved by finding the elementary divisors to base p a 

of T—pI from those of 7'—pI, and the following lemma is required. 
Lemma 3. Each elementary divisor p? of T — pI gives rise to q elementary t] 
divisors p?+! and v—q elementary divisors p? of T— pI, where o = pv+q p 
and 0<q<-v. b 
Since C’ = 4-17, the elementary divisors of 7” are the same as tl 
those of C’. In general, the latter is not a classical canonical matrix, ¥ 
but its elementary divisors to base p can be obtained by a suitable le 
permutation of the columns and the same permutation of the rows. E 
We have tl 
f ycolumns o—v columns } P 

CY 1 
- ics : 
Cv = 4 , = st 
C, ‘ : 1 te 
U" T 
b 
; te 
Then there exists a matrix x such that 
| Ge. tl 
° Cosa 
*, 5 
x7C¢x = ° ° Cpu ae F 

.. ; s 
- C, | it 
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there being q sub-matrices C,,, and v—q sub-matrices C,, where 
a= pv+q. It follows that there are g elementary divisors p?+! and 
yv—q elementary divisors p?, and the lemma is proved. 

Using Theorem 1, the dimensionality of II}, is the number of elemen- 
tary divisors to base p of 7'})—pg;;, and the nullity of IT}, is the number 
of these which are non-simple. Let e, be the number of elementary 
divisors p° of 7;;—pg;;. From Lemma 3, there are 


€;+2¢,+ 3e,+...+v(e,+6,4:+...) = Po—Py 
elementary divisors to base p of 7't})—pg;;, and of these 
C41 t 2ly pet. +P(Coy t+ Coys t---) = Dp 
are non-simple. Theorem 2 is therefore proved. 


THEOREM 3. If the matrix (Tj) has at least two distinct latent roots at 
any point, then the space J, is reducible. 


Let p, and p, be two distinct latent roots of 7. If T is non-singular, 
then there is a tensor a(7;;—p,g,;) which has at least two distinct roots 
p= 0, p = a(p.—p,). If 7 is singular, take one of the roots p,, pz to 
be zero. Let m be the greatest index of an elementary divisor to base p; 
then e, = 0ifo > m. From Theorem 2, the parallel plane IT}, associated 
with 7\” is of dimensionality py and nullity zero. If there is a zero 
latent root, py, # 0; and, if there is a non-zero latent root, py ~ n. 
Hence the space 7, admits a non-null parallel y-plane which is neither 
the 0-plane nor the tangent »-plane; it is therefore reducible, being the 
product of a V,, and a V,_,,. 

By a suitable choice of coordinates, in terms of which the metric takes 
the form given by (2.4), it can be shown that the space V,, admits a 
second-order symmetric recurrent tensor which has no non-zero charac- 
teristic roots. When all the characteristic roots are zero, py = 1; 
Theorem 2 can then be expressed in terms of the parallel plane generated 
by 7 as follows: When all the characteristic roots of T;; are zero, the 
tensor T'\) generates a parallel plane of dimensionality p, and nullity p¥. 

This parallel plane is wholly null if p, = p*. From the definitions of 
these numbers, this is true when v > 4m. 


5. Other parallel planes in a 7,. 

The sums and intersections of the various parallel planes IJ,, Il,,... 
and their null parts are also parallel planes, all of which are contained 
in II,. In particular, the planes ©, = IIf n II¥* are parallel. 


v 
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THEOREM 4, T'he null part of the parallel plane generated by T,; contains 
a sequence of null parallel planes Xy, Xsz,..., Up—, such that X,> X,,,. 
The dimensionality of X, is ¢,4,;+€,19+... and X,,_, = IIF,_}. 
If A‘ e X,,,, then 
Ty¥=0, THW=0,  gyrpi= 
for every vector ‘ of IT),,,. Multiplying the first of these equations by 
Tyg, we get 


THA = 0, 
and, since any solution of 7')é = 0 satisfies T+ VE) = 0, 
9:3 XE) = 0 


for every vector &‘ of II'. Hence A‘ € II¥, and therefore A‘ € &,, ie. L,,, 
is contained in &,. 

To prove the second part of the theorem, we use the canonical form C. 
Consider an elementary divisor p’ (o > 1) of T,;—pg,;. The solution 
of C,A = 0 may be taken to be (1, 0, 0,...).. This vector is also a solution 
of CZA = 0, and by permuting the rows and columns of C® as in the 
proof of Lemma 3, it can be seen that A corresponds to a null vector 
of IT}, if and only if o > v. Hence the number of null vectors in a basis 
for II} which also occur in II¥* is e,+e,,,+.... In particular, for 


v=m—l, 


Xn-1 is of e,, dimensions; since &,,,_, c I1*,_,, and II*_, is of e,, dimen- 
sions, it follows that &,,_, = I},_,. 

I should like to thank Professor H. S. Ruse and Professor A. G. 
Walker for their many helpful suggestions. 
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NOTE ON A THEOREM BY J. H. C. WHITEHEAD 
By W. H. COCKCROFT (Aberdeen) 


[Received 21 April 1950] 


1. Introduction 
Let K be a connected, C.W. complex,f with n-section K". In § 16 of (1), 
J. H. C. Whitehead proved that the second relative homotopy group 
m,(K*, K*) is a free crossed (7,(K1),d)-modulet, where 

d: m(K?, K1) > 2,(K?) 
is the boundary homomorphism into the fundamental group of K’. 
The object of this note is to give a proof of this result which avoids the 
geometrical arguments, involving knot theory, which are used in (2). 


2. Definitions 

Let K? = K1 vu {e3}, where {e%} is a set of 2-cells bounded by oriented 
circuits in Kt and not meeting K? or each other in any point internal 
to any of them. Let 

Po = 72(K?, K?), Pi = 7(K") 

with a common base point 2) € K!, and, as above, let d: p. > p, be the 
boundary homomorphism. Let C, = H,(K®, K) be the second relative 
homology group, with integral coefficients, of the universal covering 
complex K of K. Let hg: py->C, be the homomorphism, defined in 
(2) 480, which is onto C, and whose kernel is the commutator subgroup 
of po. 


3. THEOREM. p, is a free crossed (p,,d)-module. 

Let a\ € pp be the element which is defined by a characteristic map 
for e}, joined to x by a path in K!. We define a free crossed (p,, ¢)- 
module h,, with a basis {a}, and with ¢a, = da}. It follows from 
Lemma 2 of (2) §2 that the correspondence a, >a determines an 
operator homomorphism, 6: h, > pg, such that ¢ = d@. Obviously @ is 
onto po. 

Let c, € C, be the element which is represented by a characteristic 
map for é?, which is a cell in K which covers e%. The set {cy} is a natural 
basis for C,, which is a free p,-module, where fp, = p,/dp, ~ 7,(K?). 
The homomorphism h,: p, > C, carries a) onto c, and is an operator 

+ For definition: see (1) § 5. 
t For definition see (2) 455, § 2. 

Quart. J. Math. Oxford (2), 2 (1951), 159-60 
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homomorphism in the sense that h,(xa) = Zh,(a), for all ae p, and 
x€EC,, where Z is the co-set of p, containing x. The correspondencg 
a, > ¢, determines a similar operator homomorphism h: h, > C,, onte 
5, with h.@ = h. 
I now prove that @ is an isomorphism. Suppose 6a = 0, then déa = 0, 
therefore da = 0. Also h, 6a = 0, therefore ha = 0. Hence, by Lemma] 
of (2), a = 0 and @ is an isomorphism. 
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